Search Results for author: Xilun Chen

Found 27 papers, 15 papers with code

Towards Safety and Helpfulness Balanced Responses via Controllable Large Language Models

no code implementations • 1 Apr 2024 • Yi-Lin Tuan, Xilun Chen, Eric Michael Smith, Louis Martin, Soumya Batra, Asli Celikyilmaz, William Yang Wang, Daniel M. Bikel

As large language models (LLMs) become easily accessible nowadays, the trade-off between safety and helpfulness can significantly impact user experience.

Paper
Add Code

Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning

no code implementations • 18 Mar 2024 • Rao Fu, Jingyu Liu, Xilun Chen, Yixin Nie, Wenhan Xiong

This paper introduces Scene-LLM, a 3D-visual-language model that enhances embodied agents' abilities in interactive 3D indoor environments by integrating the reasoning strengths of Large Language Models (LLMs).

Dense Captioning Language Modelling +1

Paper
Add Code

RA-DIT: Retrieval-Augmented Dual Instruction Tuning

no code implementations • 2 Oct 2023 • Xi Victoria Lin, Xilun Chen, Mingda Chen, Weijia Shi, Maria Lomeli, Rich James, Pedro Rodriguez, Jacob Kahn, Gergely Szilvasy, Mike Lewis, Luke Zettlemoyer, Scott Yih

Retrieval-augmented language models (RALMs) improve performance by accessing long-tail and up-to-date knowledge from external data stores, but are challenging to build.

Few-Shot Learning Open-Domain Question Answering +1

Paper
Add Code

Few-Shot Data Synthesis for Open Domain Multi-Hop Question Answering

no code implementations • 23 May 2023 • Mingda Chen, Xilun Chen, Wen-tau Yih

Few-shot learning for open domain multi-hop question answering typically relies on the incontext learning capability of large language models (LLMs).

Fact Verification Few-Shot Learning +2

Paper
Add Code

VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation

no code implementations • 4 May 2023 • Xilun Chen, Lili Yu, Wenhan Xiong, Barlas Oğuz, Yashar Mehdad, Wen-tau Yih

We propose a new two-stage pre-training framework for video-to-text generation tasks such as video captioning and video question answering: A generative encoder-decoder model is first jointly pre-trained on massive image-text data to learn fundamental vision-language concepts, and then adapted to video data in an intermediate video-text pre-training stage to learn video-specific skills such as spatio-temporal reasoning.

Question Answering Text Generation +3

Paper
Add Code

Hierarchical Video-Moment Retrieval and Step-Captioning

1 code implementation • CVPR 2023 • Abhay Zala, Jaemin Cho, Satwik Kottur, Xilun Chen, Barlas Oğuz, Yasher Mehdad, Mohit Bansal

Our hierarchical benchmark consists of video retrieval, moment retrieval, and two novel moment segmentation and step captioning tasks.

Information Retrieval Moment Retrieval +4

Paper
Code

How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval

1 code implementation • 15 Feb 2023 • Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen

We hence propose a new DA approach with diverse queries and sources of supervision to progressively train a generalizable DR. As a result, DRAGON, our dense retriever trained with diverse augmentation, is the first BERT-base-sized DR to achieve state-of-the-art effectiveness in both supervised and zero-shot evaluations and even competes with models using more complex late interaction (ColBERTv2 and SPLADE++).

Contrastive Learning Data Augmentation +1

245

Paper
Code

Nonparametric Masked Language Modeling

1 code implementation • 2 Dec 2022 • Sewon Min, Weijia Shi, Mike Lewis, Xilun Chen, Wen-tau Yih, Hannaneh Hajishirzi, Luke Zettlemoyer

Existing language models (LMs) predict tokens with a softmax over a finite vocabulary, which can make it difficult to predict rare tokens or phrases.

Language Modelling Masked Language Modeling +2

153

Paper
Code

CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval

1 code implementation • 18 Nov 2022 • Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen

In this paper, we unify different multi-vector retrieval models from a token routing viewpoint and propose conditional token interaction via dynamic lexical routing, namely CITADEL, for efficient and effective multi-vector retrieval.

Retrieval

245

Paper
Code

Task-aware Retrieval with Instructions

1 code implementation • 16 Nov 2022 • Akari Asai, Timo Schick, Patrick Lewis, Xilun Chen, Gautier Izacard, Sebastian Riedel, Hannaneh Hajishirzi, Wen-tau Yih

We study the problem of retrieval with instructions, where users of a retrieval system explicitly describe their intent along with their queries.

Retrieval

146

Paper
Code

A Study on the Efficiency and Generalization of Light Hybrid Retrievers

no code implementations • 4 Oct 2022 • Man Luo, Shashank Jain, Anchit Gupta, Arash Einolghozati, Barlas Oguz, Debojeet Chatterjee, Xilun Chen, Chitta Baral, Peyman Heidari

Driven by this question, we leverage an indexing-efficient dense retriever (i. e. DrBoost) and introduce a LITE retriever that further reduces the memory of DrBoost.

Adversarial Attack Contrastive Learning +1

Paper
Add Code

Simple Local Attentions Remain Competitive for Long-Context Tasks

1 code implementation • NAACL 2022 • Wenhan Xiong, Barlas Oğuz, Anchit Gupta, Xilun Chen, Diana Liskovich, Omer Levy, Wen-tau Yih, Yashar Mehdad

Many NLP tasks require processing long contexts beyond the length limit of pretrained models.

29,192

Paper
Code

CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training

1 code implementation • Findings (NAACL) 2022 • Patrick Huber, Armen Aghajanyan, Barlas Oğuz, Dmytro Okhonko, Wen-tau Yih, Sonal Gupta, Xilun Chen

Consequently, we propose a novel QA dataset based on the Common Crawl project in this paper.

Open-Domain Question Answering

Paper
Code

Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?

2 code implementations • 13 Oct 2021 • Xilun Chen, Kushal Lakhotia, Barlas Oğuz, Anchit Gupta, Patrick Lewis, Stan Peshterliev, Yashar Mehdad, Sonal Gupta, Wen-tau Yih

Despite their recent popularity and well-known advantages, dense retrievers still lag behind sparse methods such as BM25 in their ability to reliably match salient phrases and rare entities in the query and to generalize to out-of-domain data.

Ranked #2 on Passage Retrieval on EntityQuestions

Open-Domain Question Answering Passage Retrieval +1

246

Paper
Code

Domain-matched Pre-training Tasks for Dense Retrieval

1 code implementation • Findings (NAACL) 2022 • Barlas Oğuz, Kushal Lakhotia, Anchit Gupta, Patrick Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Wen-tau Yih, Sonal Gupta, Yashar Mehdad

Pre-training on larger datasets with ever increasing model size is now a proven recipe for increased performance across almost all NLP tasks.

Ranked #2 on Passage Retrieval on Natural Questions (using extra training data)

Passage Retrieval Retrieval

245

Paper
Code

Muppet: Massive Multi-task Representations with Pre-Finetuning

2 code implementations • EMNLP 2021 • Armen Aghajanyan, Anchit Gupta, Akshat Shrivastava, Xilun Chen, Luke Zettlemoyer, Sonal Gupta

We propose pre-finetuning, an additional large-scale learning stage between language model pre-training and fine-tuning.

Ranked #3 on Text Summarization on GigaWord (using extra training data)

Abstractive Text Summarization Common Sense Reasoning +7

Paper
Code

NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

no code implementations • 1 Jan 2021 • Sewon Min, Jordan Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini, Nicola De Cao, Edouard Grave, Ikuya Yamada, Sonse Shimaoka, Masatoshi Suzuki, Shumpei Miyawaki, Shun Sato, Ryo Takahashi, Jun Suzuki, Martin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz, Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao, Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Schlichtkrull, Sonal Gupta, Yashar Mehdad, Wen-tau Yih

We review the EfficientQA competition from NeurIPS 2020.

Open-Domain Question Answering Retrieval

Paper
Add Code

Learning Better Structured Representations Using Low-rank Adaptive Label Smoothing

no code implementations • ICLR 2021 • Asish Ghoshal, Xilun Chen, Sonal Gupta, Luke Zettlemoyer, Yashar Mehdad

Training with soft targets instead of hard targets has been shown to improve performance and calibration of deep neural networks.

Generalization Bounds Machine Translation +4

Paper
Add Code

UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering

1 code implementation • Findings (NAACL) 2022 • Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Schlichtkrull, Sonal Gupta, Yashar Mehdad, Scott Yih

We study open-domain question answering with structured, unstructured and semi-structured knowledge sources, including text, tables, lists and knowledge bases.

Ranked #1 on Open-Domain Question Answering on WebQuestions (using extra training data)

Knowledge Base Question Answering Open-Domain Question Answering

Paper
Code

Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing

no code implementations • EMNLP 2020 • Xilun Chen, Asish Ghoshal, Yashar Mehdad, Luke Zettlemoyer, Sonal Gupta

Task-oriented semantic parsing is a critical component of virtual assistants, which is responsible for understanding the user's intents (set reminder, play music, etc.).

Domain Adaptation Meta-Learning +2

Paper
Add Code

Multi-Source Cross-Lingual Model Transfer: Learning What to Share

1 code implementation • ACL 2019 • Xilun Chen, Ahmed Hassan Awadallah, Hany Hassan, Wei Wang, Claire Cardie

In this work, we focus on the multilingual transfer setting where training data in multiple source languages is leveraged to further boost target language performance.

Ranked #10 on Cross-Lingual NER on CoNLL Dutch

Cross-Lingual NER text-classification +2

Paper
Code

Zero-Resource Multilingual Model Transfer: Learning What to Share

no code implementations • 27 Sep 2018 • Xilun Chen, Ahmed Hassan Awadallah, Hany Hassan, Wei Wang, Claire Cardie

In this work, we propose a zero-resource multilingual transfer learning model that can utilize training data in multiple source languages, while not requiring target language training data nor cross-lingual supervision.

Cross-Lingual Transfer text-classification +2

Paper
Add Code

Unsupervised Multilingual Word Embeddings

3 code implementations • EMNLP 2018 • Xilun Chen, Claire Cardie

Multilingual Word Embeddings (MWEs) represent words from multiple languages in a single distributional vector space.

Multilingual Word Embeddings Translation +2

Paper
Code

Multinomial Adversarial Networks for Multi-Domain Text Classification

1 code implementation • NAACL 2018 • Xilun Chen, Claire Cardie

Many text classification tasks are known to be highly domain-dependent.

Cross-Domain Text Classification General Classification +2

Paper
Code

Combining Global Models for Parsing Universal Dependencies

no code implementations • CONLL 2017 • Tianze Shi, Felix G. Wu, Xilun Chen, Yao Cheng

We describe our entry, C2L2, to the CoNLL 2017 shared task on parsing Universal Dependencies from raw text.

Boundary Detection Dependency Parsing +1

Paper
Add Code

Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification

2 code implementations • TACL 2018 • Xilun Chen, Yu Sun, Ben Athiwaratkun, Claire Cardie, Kilian Weinberger

To tackle the sentiment classification problem in low-resource languages without adequate annotated data, we propose an Adversarial Deep Averaging Network (ADAN) to transfer the knowledge learned from labeled data on a resource-rich source language to low-resource languages where only unlabeled data exists.

Classification Cross-Lingual Document Classification +5

Paper
Code

Multi-Domain Adaptation for SMT Using Multi-Task Learning

no code implementations • EMNLP 2013 • Lei Cui, Xilun Chen, Dong-dong Zhang, Shujie Liu, Mu Li, Ming Zhou

Domain Adaptation Machine Translation +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.