Search Results for author: Misha Sra

Found 6 papers, 3 papers with code

TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing

1 code implementation17 Apr 2024 Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Kuo-Chin Lien, Misha Sra, Pradeep Sen

Previous approaches have focused on either fine-tuning pre-trained T2I models on specific datasets to generate certain kinds of images (e. g., with a specific object or person), or on optimizing the weights, text prompts, and/or learning features for each input image in an attempt to coax the image generator to produce the desired result.

XplainLLM: A QA Explanation Dataset for Understanding LLM Decision-Making

no code implementations15 Nov 2023 Zichen Chen, Jianda Chen, Mitali Gaidhani, Ambuj Singh, Misha Sra

The explanation component includes a why-choose explanation, a why-not-choose explanation, and a set of reason-elements that underlie the LLM's decision.

Decision Making Graph Attention +4

LMExplainer: a Knowledge-Enhanced Explainer for Language Models

no code implementations29 Mar 2023 Zichen Chen, Ambuj K Singh, Misha Sra

We propose LMExplainer, a knowledge-enhanced explainer for LMs that can provide human-understandable explanations.

Decision Making Graph Attention

SPOTR: Spatio-temporal Pose Transformers for Human Motion Prediction

no code implementations11 Mar 2023 Avinash Ajit Nargund, Misha Sra

Our contributions are threefold: (i) we frame human motion prediction as a sequence-to-sequence problem and propose a non-autoregressive Transformer to forecast a sequence of poses in parallel; (ii) our method is activity agnostic; (iii) we show that despite its simplicity, our approach is able to make accurate predictions, achieving better or comparable results compared to the state-of-the-art on two public datasets, with far fewer parameters and much faster inference.

Autonomous Driving Human motion prediction +1

Self-Supervised Knowledge Assimilation for Expert-Layman Text Style Transfer

1 code implementation6 Oct 2021 Wenda Xu, Michael Saxon, Misha Sra, William Yang Wang

This is a particularly notable issue in the medical domain, where layman are often confused by medical text online.

Language Modelling Self-Supervised Learning +2

Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

1 code implementation26 Jun 2021 Pulkit Tandon, Shubham Chandak, Pat Pataranutaporn, Yimeng Liu, Anesu M. Mapuranga, Pattie Maes, Tsachy Weissman, Misha Sra

Video represents the majority of internet traffic today, driving a continual race between the generation of higher quality content, transmission of larger file sizes, and the development of network infrastructure.

Talking Face Generation Talking Head Generation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.