Search Results for author: Misha Sra

Found 6 papers, 3 papers with code

TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing

1 code implementation • 17 Apr 2024 • Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Kuo-Chin Lien, Misha Sra, Pradeep Sen

Previous approaches have focused on either fine-tuning pre-trained T2I models on specific datasets to generate certain kinds of images (e. g., with a specific object or person), or on optimizing the weights, text prompts, and/or learning features for each input image in an attempt to coax the image generator to produce the desired result.

Paper
Code

XplainLLM: A QA Explanation Dataset for Understanding LLM Decision-Making

no code implementations • 15 Nov 2023 • Zichen Chen, Jianda Chen, Mitali Gaidhani, Ambuj Singh, Misha Sra

The explanation component includes a why-choose explanation, a why-not-choose explanation, and a set of reason-elements that underlie the LLM's decision.

Decision Making Graph Attention +4

Paper
Add Code

LMExplainer: a Knowledge-Enhanced Explainer for Language Models

no code implementations • 29 Mar 2023 • Zichen Chen, Ambuj K Singh, Misha Sra

We propose LMExplainer, a knowledge-enhanced explainer for LMs that can provide human-understandable explanations.

Decision Making Graph Attention

Paper
Add Code

SPOTR: Spatio-temporal Pose Transformers for Human Motion Prediction

no code implementations • 11 Mar 2023 • Avinash Ajit Nargund, Misha Sra

Our contributions are threefold: (i) we frame human motion prediction as a sequence-to-sequence problem and propose a non-autoregressive Transformer to forecast a sequence of poses in parallel; (ii) our method is activity agnostic; (iii) we show that despite its simplicity, our approach is able to make accurate predictions, achieving better or comparable results compared to the state-of-the-art on two public datasets, with far fewer parameters and much faster inference.

Autonomous Driving Human motion prediction +1

Paper
Add Code

Self-Supervised Knowledge Assimilation for Expert-Layman Text Style Transfer

1 code implementation • 6 Oct 2021 • Wenda Xu, Michael Saxon, Misha Sra, William Yang Wang

This is a particularly notable issue in the medical domain, where layman are often confused by medical text online.

Language Modelling Self-Supervised Learning +2

Paper
Code

Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

1 code implementation • 26 Jun 2021 • Pulkit Tandon, Shubham Chandak, Pat Pataranutaporn, Yimeng Liu, Anesu M. Mapuranga, Pattie Maes, Tsachy Weissman, Misha Sra

Video represents the majority of internet traffic today, driving a continual race between the generation of higher quality content, transmission of larger file sizes, and the development of network infrastructure.

Talking Face Generation Talking Head Generation +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.