Search Results for author: Sonal Gupta

Found 34 papers, 11 papers with code

EASE: Extractive-Abstractive Summarization End-to-End using the Information Bottleneck Principle

no code implementations • EMNLP (newsum) 2021 • Haoran Li, Arash Einolghozati, Srinivasan Iyer, Bhargavi Paranjape, Yashar Mehdad, Sonal Gupta, Marjan Ghazvininejad

To achieve the best of both worlds, we propose EASE, an extractive-abstractive framework that generates concise abstractive summaries that can be traced back to an extractive summary.

Abstractive Text Summarization Extractive Summarization +1

Paper
Add Code

Improving Text-to-Text Pre-trained Models for the Graph-to-Text Task

no code implementations • ACL (WebNLG, INLG) 2020 • Zixiaofan Yang, Arash Einolghozati, Hakan Inan, Keith Diedrick, Angela Fan, Pinar Donmez, Sonal Gupta

Converting a knowledge graph or sub-graph to natural text is useful when answering questions based on a knowledge base.

Paper
Add Code

Getting to Production with Few-shot Natural Language Generation Models

no code implementations • SIGDIAL (ACL) 2021 • Peyman Heidari, Arash Einolghozati, Shashank Jain, Soumya Batra, Lee Callender, Ankit Arun, Shawn Mei, Sonal Gupta, Pinar Donmez, Vikas Bhardwaj, Anuj Kumar, Michael White

In this paper, we study the utilization of pre-trained language models to enable few-shotNatural Language Generation (NLG) in task-oriented dialog systems.

Language Modelling Text Generation

Paper
Add Code

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

no code implementations • 17 Nov 2023 • Animesh Sinha, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy Bearman, Dhruv Mahajan

Evaluation results show our method improves visual quality by 14%, prompt alignment by 16. 2% and scene diversity by 15. 3%, compared to prompt engineering the base Emu model for stickers generation.

Image Generation Prompt Engineering

Paper
Add Code

Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation

no code implementations • ICCV 2023 • Samaneh Azadi, Akbar Shah, Thomas Hayes, Devi Parikh, Sonal Gupta

However, existing approaches are limited by their reliance on relatively small-scale motion capture data, leading to poor performance on more diverse, in-the-wild prompts.

Ranked #20 on Motion Synthesis on HumanML3D

Motion Synthesis Text-to-Video Generation +1

Paper
Add Code

Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation

no code implementations • 17 Apr 2023 • Jie An, Songyang Zhang, Harry Yang, Sonal Gupta, Jia-Bin Huang, Jiebo Luo, Xi Yin

In contrast, we propose a parameter-free temporal shift module that can leverage the spatial U-Net as is for video generation.

Super-Resolution Text-to-Image Generation +2

Paper
Add Code

Text-Conditional Contextualized Avatars For Zero-Shot Personalization

no code implementations • 14 Apr 2023 • Samaneh Azadi, Thomas Hayes, Akbar Shah, Guan Pang, Devi Parikh, Sonal Gupta

Recent large-scale text-to-image generation models have made significant improvements in the quality, realism, and diversity of the synthesized images and enable users to control the created content through language.

Text to 3D Text-to-Image Generation

Paper
Add Code

SpaText: Spatio-Textual Representation for Controllable Image Generation

no code implementations • CVPR 2023 • Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin

Due to lack of large-scale datasets that have a detailed textual description for each region in the image, we choose to leverage the current large-scale text-to-image datasets and base our approach on a novel CLIP-based spatio-textual representation, and show its effectiveness on two state-of-the-art diffusion models: pixel-based and latent-based.

Text-to-Image Generation

Paper
Add Code

Make-A-Video: Text-to-Video Generation without Text-Video Data

2 code implementations • 29 Sep 2022 • Uriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv Taigman

We propose Make-A-Video -- an approach for directly translating the tremendous recent progress in Text-to-Image (T2I) generation to Text-to-Video (T2V).

Ranked #3 on Text-to-Video Generation on MSR-VTT (CLIP-FID metric)

Image Generation Super-Resolution +2

1,838

Paper
Code

CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training

1 code implementation • Findings (NAACL) 2022 • Patrick Huber, Armen Aghajanyan, Barlas Oğuz, Dmytro Okhonko, Wen-tau Yih, Sonal Gupta, Xilun Chen

Consequently, we propose a novel QA dataset based on the Common Crawl project in this paper.

Open-Domain Question Answering

Paper
Code

Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?

2 code implementations • 13 Oct 2021 • Xilun Chen, Kushal Lakhotia, Barlas Oğuz, Anchit Gupta, Patrick Lewis, Stan Peshterliev, Yashar Mehdad, Sonal Gupta, Wen-tau Yih

Despite their recent popularity and well-known advantages, dense retrievers still lag behind sparse methods such as BM25 in their ability to reliably match salient phrases and rare entities in the query and to generalize to out-of-domain data.

Ranked #2 on Passage Retrieval on EntityQuestions

Open-Domain Question Answering Passage Retrieval +1

247

Paper
Code

Domain-matched Pre-training Tasks for Dense Retrieval

1 code implementation • Findings (NAACL) 2022 • Barlas Oğuz, Kushal Lakhotia, Anchit Gupta, Patrick Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Wen-tau Yih, Sonal Gupta, Yashar Mehdad

Pre-training on larger datasets with ever increasing model size is now a proven recipe for increased performance across almost all NLP tasks.

Ranked #2 on Passage Retrieval on Natural Questions (using extra training data)

Passage Retrieval Retrieval

247

Paper
Code

EASE: Extractive-Abstractive Summarization with Explanations

no code implementations • 14 May 2021 • Haoran Li, Arash Einolghozati, Srinivasan Iyer, Bhargavi Paranjape, Yashar Mehdad, Sonal Gupta, Marjan Ghazvininejad

Current abstractive summarization systems outperform their extractive counterparts, but their widespread adoption is inhibited by the inherent lack of interpretability.

Abstractive Text Summarization Document Summarization +1

Paper
Add Code

El Volumen Louder Por Favor: Code-switching in Task-oriented Semantic Parsing

no code implementations • EACL 2021 • Arash Einolghozati, Abhinav Arora, Lorena Sainz-Maza Lecanda, Anuj Kumar, Sonal Gupta

Being able to parse code-switched (CS) utterances, such as Spanish+English or Hindi+English, is essential to democratize task-oriented semantic parsing systems for certain locales.

Data Augmentation Semantic Parsing

Paper
Add Code

Muppet: Massive Multi-task Representations with Pre-Finetuning

2 code implementations • EMNLP 2021 • Armen Aghajanyan, Anchit Gupta, Akshat Shrivastava, Xilun Chen, Luke Zettlemoyer, Sonal Gupta

We propose pre-finetuning, an additional large-scale learning stage between language model pre-training and fine-tuning.

Ranked #3 on Text Summarization on GigaWord (using extra training data)

Abstractive Text Summarization Common Sense Reasoning +7

Paper
Code

Learning Better Structured Representations Using Low-rank Adaptive Label Smoothing

no code implementations • ICLR 2021 • Asish Ghoshal, Xilun Chen, Sonal Gupta, Luke Zettlemoyer, Yashar Mehdad

Training with soft targets instead of hard targets has been shown to improve performance and calibration of deep neural networks.

Generalization Bounds Machine Translation +4

Paper
Add Code

NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

no code implementations • 1 Jan 2021 • Sewon Min, Jordan Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini, Nicola De Cao, Edouard Grave, Ikuya Yamada, Sonse Shimaoka, Masatoshi Suzuki, Shumpei Miyawaki, Shun Sato, Ryo Takahashi, Jun Suzuki, Martin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz, Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao, Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Schlichtkrull, Sonal Gupta, Yashar Mehdad, Wen-tau Yih

We review the EfficientQA competition from NeurIPS 2020.

Open-Domain Question Answering Retrieval

Paper
Add Code

UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering

1 code implementation • Findings (NAACL) 2022 • Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Schlichtkrull, Sonal Gupta, Yashar Mehdad, Scott Yih

We study open-domain question answering with structured, unstructured and semi-structured knowledge sources, including text, tables, lists and knowledge bases.

Ranked #1 on Open-Domain Question Answering on WebQuestions (using extra training data)

Knowledge Base Question Answering Open-Domain Question Answering

Paper
Code

Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning

2 code implementations • ACL 2021 • Armen Aghajanyan, Luke Zettlemoyer, Sonal Gupta

Although pretrained language models can be fine-tuned to produce state-of-the-art results for a very wide range of language understanding tasks, the dynamics of this process are not well understood, especially in the low data regime.

Ranked #1 on Transfer Learning on Amazon Review Polarity (Structure Aware Intrinsic Dimension metric)

Generalization Bounds Language Modelling +3

122

Paper
Code

Sound Natural: Content Rephrasing in Dialog Systems

1 code implementation • EMNLP 2020 • Arash Einolghozati, Anchit Gupta, Keith Diedrick, Sonal Gupta

We introduce a new task of rephrasing for a more natural virtual assistant.

Language Modelling Paraphrase Generation

Paper
Code

Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing

no code implementations • EMNLP 2020 • Xilun Chen, Asish Ghoshal, Yashar Mehdad, Luke Zettlemoyer, Sonal Gupta

Task-oriented semantic parsing is a critical component of virtual assistants, which is responsible for understanding the user's intents (set reminder, play music, etc.).

Domain Adaptation Meta-Learning +2

Paper
Add Code

Conversational Semantic Parsing

no code implementations • EMNLP 2020 • Armen Aghajanyan, Jean Maillard, Akshat Shrivastava, Keith Diedrick, Mike Haeger, Haoran Li, Yashar Mehdad, Ves Stoyanov, Anuj Kumar, Mike Lewis, Sonal Gupta

In this paper, we propose a semantic representation for such task-oriented conversational systems that can represent concepts such as co-reference and context carryover, enabling comprehensive understanding of queries in a session.

dialog state tracking Semantic Parsing

Paper
Add Code

MTOP: A Comprehensive Multilingual Task-Oriented Semantic Parsing Benchmark

no code implementations • EACL 2021 • Haoran Li, Abhinav Arora, Shuohui Chen, Anchit Gupta, Sonal Gupta, Yashar Mehdad

Scaling semantic parsing models for task-oriented dialog systems to new languages is often expensive and time-consuming due to the lack of available datasets.

Benchmarking Semantic Parsing +1

Paper
Add Code

Better Fine-Tuning by Reducing Representational Collapse

3 code implementations • ICLR 2021 • Armen Aghajanyan, Akshat Shrivastava, Anchit Gupta, Naman Goyal, Luke Zettlemoyer, Sonal Gupta

Although widely adopted, existing approaches for fine-tuning pre-trained language models have been shown to be unstable across hyper-parameter settings, motivating recent work on trust region methods.

Ranked #1 on Cross-Lingual Natural Language Inference on XNLI Zero-Shot English-to-Spanish

Abstractive Text Summarization Cross-Lingual Natural Language Inference

29,249

Paper
Code

Likelihood Ratios and Generative Classifiers for Unsupervised Out-of-Domain Detection In Task Oriented Dialog

1 code implementation • 30 Dec 2019 • Varun Gangal, Abhinav Arora, Arash Einolghozati, Sonal Gupta

We are hitherto the first to investigate the use of generative classifiers for OOD detection at test-time.

4k Out of Distribution (OOD) Detection +1

Paper
Code

Improving Robustness of Task Oriented Dialog Systems

no code implementations • 12 Nov 2019 • Arash Einolghozati, Sonal Gupta, Mrinal Mohit, Rushin Shah

However, evaluating a model's robustness to these changes is harder for language since words are discrete and an automated change (e. g. adding `noise') to a query sometimes changes the meaning and thus labels of a query.

Adversarial Attack Data Augmentation +4

Paper
Add Code

Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog

no code implementations • IJCNLP 2019 • Panupong Pasupat, Sonal Gupta, M, Karishma yam, Rushin Shah, Mike Lewis, Luke Zettlemoyer

We propose a semantic parser for parsing compositional utterances into Task Oriented Parse (TOP), a tree representation that has intents and slots as labels of nesting tree nodes.

Semantic Parsing valid

Paper
Add Code

Improving Semantic Parsing for Task Oriented Dialog

no code implementations • 15 Feb 2019 • Arash Einolghozati, Panupong Pasupat, Sonal Gupta, Rushin Shah, Mrinal Mohit, Mike Lewis, Luke Zettlemoyer

Semantic parsing using hierarchical representations has recently been proposed for task oriented dialog with promising results [Gupta et al 2018].

Language Modelling Re-Ranking +1

Paper
Add Code

PyText: A Seamless Path from NLP research to production

2 code implementations • 12 Dec 2018 • Ahmed Aly, Kushal Lakhotia, Shicong Zhao, Mrinal Mohit, Barlas Oguz, Abhinav Arora, Sonal Gupta, Christopher Dewan, Stef Nelson-Lindall, Rushin Shah

We introduce PyText - a deep learning based NLP modeling framework built on PyTorch.

6,348

Paper
Code

Cross-Lingual Transfer Learning for Multilingual Task Oriented Dialog

no code implementations • NAACL 2019 • Sebastian Schuster, Sonal Gupta, Rushin Shah, Mike Lewis

We use this data set to evaluate three different cross-lingual transfer methods: (1) translating the training data, (2) using cross-lingual pre-trained embeddings, and (3) a novel method of using a multilingual machine translation encoder as contextual word representations.

Cross-Lingual Transfer Machine Translation +1