Search Results for author: Noah A. Smith

Found 255 papers, 121 papers with code

Expected Validation Performance and Estimation of a Random Variable’s Maximum

no code implementations • Findings (EMNLP) 2021 • Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz, Noah A. Smith

We find that the two biased estimators lead to the fewest incorrect conclusions, which hints at the importance of minimizing variance and MSE.

Paper
Add Code

The Engage Corpus: A Social Media Dataset for Text-Based Recommender Systems

no code implementations • LREC 2022 • Daniel Cheng, Kyle Yan, Phillip Keung, Noah A. Smith

Social media platforms play an increasingly important role as forums for public discourse.

Misinformation Recommendation Systems

Paper
Add Code

Writing Strategies for Science Communication: Data and Computational Analysis

no code implementations • EMNLP 2020 • Tal August, Lauren Kim, Katharina Reinecke, Noah A. Smith

We collect a corpus of 128k science writing documents in English and annotate a subset of this corpus.

Paper
Add Code

Domain Mismatch Doesn’t Always Prevent Cross-lingual Transfer Learning

no code implementations • LREC 2022 • Daniel Edmiston, Phillip Keung, Noah A. Smith

Cross-lingual transfer learning without labeled target language data or parallel text has been surprisingly effective in zero-shot cross-lingual classification, question answering, unsupervised machine translation, etc.

Bilingual Lexicon Induction Cross-Lingual Transfer +5

Paper
Add Code

A Taxonomy of Ambiguity Types for NLP

no code implementations • 21 Mar 2024 • Margaret Y. Li, Alisa Liu, Zhaofeng Wu, Noah A. Smith

Ambiguity is an critical component of language that allows for more effective communication between speakers, but is often ignored in NLP.

Paper
Add Code

RewardBench: Evaluating Reward Models for Language Modeling

1 code implementation • 20 Mar 2024 • Nathan Lambert, Valentina Pyatkin, Jacob Morrison, LJ Miranda, Bill Yuchen Lin, Khyathi Chandu, Nouha Dziri, Sachin Kumar, Tom Zick, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi

In this paper, we present RewardBench, a benchmark dataset and code-base for evaluation, to enhance scientific understanding of reward models.

Instruction Following Language Modelling

163

Paper
Code

Third-Party Language Model Performance Prediction from Instruction

1 code implementation • 19 Mar 2024 • Rahul Nadkarni, Yizhong Wang, Noah A. Smith

Language model-based instruction-following systems have lately shown increasing performance on many benchmark tasks, demonstrating the capability of adapting to a broad variety of instructions.

Instruction Following Language Modelling

Paper
Code

Encode Once and Decode in Parallel: Efficient Transformer Decoding

1 code implementation • 19 Mar 2024 • Bo-Ru Lu, Nikita Haduong, Chien-Yu Lin, Hao Cheng, Noah A. Smith, Mari Ostendorf

Transformer-based NLP models are powerful but have high computational costs that limit deployment scenarios.

Dialogue State Tracking Question Answering

Paper
Code

Set the Clock: Temporal Alignment of Pretrained Language Models

1 code implementation • 26 Feb 2024 • Bowen Zhao, Zander Brumbaugh, Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith

We then develop several methods, from prompting to finetuning, to align LMs to use their most recent knowledge when answering questions, and investigate various factors in this alignment.

Paper
Code

OLMo: Accelerating the Science of Language Models

2 code implementations • 1 Feb 2024 • Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi

Given the importance of these details in scientifically studying these models, including their biases and potential risks, we believe it is essential for the research community to have access to powerful, truly open LMs.

Language Modelling

3,897

Paper
Code

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

1 code implementation • 31 Jan 2024 • Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo

Language models have become a critical technology to tackling a wide range of natural language processing tasks, yet many details about how the best-performing language models were developed are not reported.

Language Modelling

744

Paper
Code

Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models

no code implementations • 19 Jan 2024 • Terra Blevins, Tomasz Limisiewicz, Suchin Gururangan, Margaret Li, Hila Gonen, Noah A. Smith, Luke Zettlemoyer

Despite their popularity in non-English NLP, multilingual language models often underperform monolingual ones due to inter-language competition for model parameters.

Paper
Add Code

Tuning Language Models by Proxy

1 code implementation • 16 Jan 2024 • Alisa Liu, Xiaochuang Han, Yizhong Wang, Yulia Tsvetkov, Yejin Choi, Noah A. Smith

Despite the general capabilities of large pretrained language models, they consistently benefit from further adaptation to better achieve desired behaviors.

Domain Adaptation Math +1

Paper
Code

Time is Encoded in the Weights of Finetuned Language Models

1 code implementation • 20 Dec 2023 • Kai Nylund, Suchin Gururangan, Noah A. Smith

We present time vectors, a simple tool to customize language models to new time periods.

Language Modelling

Paper
Code

Paloma: A Benchmark for Evaluating Language Model Fit

no code implementations • 16 Dec 2023 • Ian Magnusson, Akshita Bhagia, Valentin Hofmann, Luca Soldaini, Ananya Harsh Jha, Oyvind Tafjord, Dustin Schwenk, Evan Pete Walsh, Yanai Elazar, Kyle Lo, Dirk Groeneveld, Iz Beltagy, Hannaneh Hajishirzi, Noah A. Smith, Kyle Richardson, Jesse Dodge

We invite submissions to our benchmark and organize results by comparability based on compliance with guidelines such as removal of benchmark contamination from pretraining.

Language Modelling

Paper
Add Code

Language Models: A Guide for the Perplexed

no code implementations • 29 Nov 2023 • Sofia Serrano, Zander Brumbaugh, Noah A. Smith

Given the growing importance of AI literacy, we decided to write this tutorial to help narrow the gap between the discourse among those who study language models -- the core technology underlying ChatGPT and similar products -- and those who are intrigued and want to learn more about them.

Language Modelling

Paper
Add Code

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

2 code implementations • 17 Nov 2023 • Hamish Ivison, Yizhong Wang, Valentina Pyatkin, Nathan Lambert, Matthew Peters, Pradeep Dasigi, Joel Jang, David Wadden, Noah A. Smith, Iz Beltagy, Hannaneh Hajishirzi

Since the release of T\"ULU [Wang et al., 2023b], open resources for instruction tuning have developed quickly, from better base models to new finetuning techniques.

976

Paper
Code

Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals

no code implementations • 16 Nov 2023 • Yanai Elazar, Bhargavi Paranjape, Hao Peng, Sarah Wiegreffe, Khyathi Raghavi, Vivek Srikumar, Sameer Singh, Noah A. Smith

Previous work has found that datasets with paired inputs are prone to correlations between a specific part of the input (e. g., the hypothesis in NLI) and the label; consequently, models trained only on those outperform chance.

counterfactual In-Context Learning +2

Paper
Add Code

ACID: Abstractive, Content-Based IDs for Document Retrieval with Language Models

no code implementations • 14 Nov 2023 • Haoxin Li, Phillip Keung, Daniel Cheng, Jungo Kasai, Noah A. Smith

Our results demonstrate the effectiveness of human-readable, natural-language IDs in generative retrieval with LMs.

Language Modelling Large Language Model +2

Paper
Add Code

What's In My Big Data?

1 code implementation • 31 Oct 2023 • Yanai Elazar, Akshita Bhagia, Ian Magnusson, Abhilasha Ravichander, Dustin Schwenk, Alane Suhr, Pete Walsh, Dirk Groeneveld, Luca Soldaini, Sameer Singh, Hanna Hajishirzi, Noah A. Smith, Jesse Dodge

We open-source WIMBD's code and artifacts to provide a standard set of evaluations for new text-based corpora and to encourage more analyses and transparency around them.

Benchmarking

122

Paper
Code

That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?

1 code implementation • 23 Oct 2023 • Jaechan Lee, Alisa Liu, Orevaoghene Ahia, Hila Gonen, Noah A. Smith

In experiments, we compare MT-specific models and language models for (i) their preference when given an ambiguous subsentence, (ii) their sensitivity to disambiguating context, and (iii) the performance disparity between figurative and literal source sentences.

Translation

Paper
Code

In-Context Pretraining: Language Modeling Beyond Document Boundaries

no code implementations • 16 Oct 2023 • Weijia Shi, Sewon Min, Maria Lomeli, Chunting Zhou, Margaret Li, Gergely Szilvasy, Rich James, Xi Victoria Lin, Noah A. Smith, Luke Zettlemoyer, Scott Yih, Mike Lewis

Large language models (LMs) are currently trained to predict tokens given document prefixes, enabling them to directly perform long-form generation and prompting-style tasks which can be reduced to document completion.

In-Context Learning Language Modelling +1

Paper
Add Code

SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore

1 code implementation • 8 Aug 2023 • Sewon Min, Suchin Gururangan, Eric Wallace, Hannaneh Hajishirzi, Noah A. Smith, Luke Zettlemoyer

SILO is built by (1) training a parametric LM on Open License Corpus (OLC), a new corpus we curate with 228B tokens of public domain and permissively licensed text and (2) augmenting it with a more general and easily modifiable nonparametric datastore (e. g., containing copyrighted books or news) that is only queried during inference.

Language Modelling Sentence

Paper
Code

Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation

no code implementations • 19 Jul 2023 • Hao Peng, Qingqing Cao, Jesse Dodge, Matthew E. Peters, Jared Fernandez, Tom Sherborne, Kyle Lo, Sam Skjonsberg, Emma Strubell, Darrell Plessas, Iz Beltagy, Evan Pete Walsh, Noah A. Smith, Hannaneh Hajishirzi

In response, we introduce Pentathlon, a benchmark for holistic and realistic evaluation of model efficiency.

Paper
Add Code

Does Collaborative Human-LM Dialogue Generation Help Information Extraction from Human Dialogues?

no code implementations • 13 Jul 2023 • Bo-Ru Lu, Nikita Haduong, Chia-Hsuan Lee, Zeqiu Wu, Hao Cheng, Paul Koester, Jean Utke, Tao Yu, Noah A. Smith, Mari Ostendorf

The capabilities of pretrained language models have opened opportunities to explore new application areas, but applications involving human-human interaction are limited by the fact that most data is protected from public release for privacy reasons.

Dialogue Generation Dialogue State Tracking +1

Paper
Add Code

Estimating the Causal Effect of Early ArXiving on Paper Acceptance

2 code implementations • 24 Jun 2023 • Yanai Elazar, Jiayao Zhang, David Wadden, Bo Zhang, Noah A. Smith

However, since quality is a challenging construct to estimate, we use the negative outcome control method, using paper citation count as a control variable to debias the quality confounding effect.

Causal Inference

Paper
Code

Reproducibility in NLP: What Have We Learned from the Checklist?

no code implementations • 16 Jun 2023 • Ian Magnusson, Noah A. Smith, Jesse Dodge

Scientific progress in NLP rests on the reproducibility of researchers' claims.

Paper
Add Code

Morphosyntactic probing of multilingual BERT models

1 code implementation • 9 Jun 2023 • Judit Acs, Endre Hamerlik, Roy Schwartz, Noah A. Smith, Andras Kornai

We introduce an extensive dataset for multilingual probing of morphological information in language models (247 tasks across 42 languages from 10 families), each consisting of a sentence with a target word and a morphological tag as the desired label, derived from the Universal Dependencies treebanks.

Sentence TAG

Paper
Code

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources

1 code implementation • NeurIPS 2023 • Yizhong Wang, Hamish Ivison, Pradeep Dasigi, Jack Hessel, Tushar Khot, Khyathi Raghavi Chandu, David Wadden, Kelsey MacMillan, Noah A. Smith, Iz Beltagy, Hannaneh Hajishirzi

Our evaluations show that the best model in any given evaluation reaches on average 87% of ChatGPT performance, and 73% of GPT-4 performance, suggesting that further investment in building better base models and instruction-tuning data is required to close the gap.

Instruction Following

976

Paper
Code

Stubborn Lexical Bias in Data and Models

no code implementations • 3 Jun 2023 • Sofia Serrano, Jesse Dodge, Noah A. Smith

Using a new statistical method, we examine whether such spurious patterns in data appear in models trained on the data.

Natural Language Inference

Paper
Add Code

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

no code implementations • NeurIPS 2023 • Zeqiu Wu, Yushi Hu, Weijia Shi, Nouha Dziri, Alane Suhr, Prithviraj Ammanabrolu, Noah A. Smith, Mari Ostendorf, Hannaneh Hajishirzi

We introduce Fine-Grained RLHF, a framework that enables training and learning from reward functions that are fine-grained in two respects: (1) density, providing a reward after every segment (e. g., a sentence) is generated; and (2) incorporating multiple reward models associated with different feedback types (e. g., factual incorrectness, irrelevance, and information incompleteness).

Language Modelling Long Form Question Answering +2

Paper
Add Code

Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models

no code implementations • 23 May 2023 • Orevaoghene Ahia, Sachin Kumar, Hila Gonen, Jungo Kasai, David R. Mortensen, Noah A. Smith, Yulia Tsvetkov

Language models have graduated from being research prototypes to commercialized products offered as web APIs, and recent works have highlighted the multilingual capabilities of these products.

Fairness Language Modelling

Paper
Add Code

How Language Model Hallucinations Can Snowball

1 code implementation • 22 May 2023 • Muru Zhang, Ofir Press, William Merrill, Alisa Liu, Noah A. Smith

A major risk of using language models in practical applications is their tendency to hallucinate incorrect statements.

Hallucination Language Modelling +1

Paper
Code

Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements

1 code implementation • 5 May 2023 • Jiacheng Liu, Wenya Wang, Dianzhuo Wang, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi

Despite the much discussed capabilities of today's language models, they are still prone to silly and unexpected commonsense failures.

Paper
Code

We're Afraid Language Models Aren't Modeling Ambiguity

1 code implementation • 27 Apr 2023 • Alisa Liu, Zhaofeng Wu, Julian Michael, Alane Suhr, Peter West, Alexander Koller, Swabha Swayamdipta, Noah A. Smith, Yejin Choi

We find that the task remains extremely challenging, including for GPT-4, whose generated disambiguations are considered correct only 32% of the time in human evaluation, compared to 90% for disambiguations in our dataset.

Sentence

Paper
Code

Scaling Expert Language Models with Unsupervised Domain Discovery

1 code implementation • 24 Mar 2023 • Suchin Gururangan, Margaret Li, Mike Lewis, Weijia Shi, Tim Althoff, Noah A. Smith, Luke Zettlemoyer

Large language models are typically trained densely: all parameters are updated with respect to all inputs.

Language Modelling

104

Paper
Code

NarrowBERT: Accelerating Masked Language Model Pretraining and Inference

1 code implementation • 11 Jan 2023 • Haoxin Li, Phillip Keung, Daniel Cheng, Jungo Kasai, Noah A. Smith

We propose NarrowBERT, a modified transformer encoder that increases the throughput for masked language model pretraining by more than $2\times$.

Language Modelling NER +2

Paper
Code

PromptCap: Prompt-Guided Image Captioning for VQA with GPT-3

no code implementations • ICCV 2023 • Yushi Hu, Hang Hua, Zhengyuan Yang, Weijia Shi, Noah A. Smith, Jiebo Luo

PromptCap outperforms generic captions by a large margin and achieves state-of-the-art accuracy on knowledge-based VQA tasks (60. 4% on OK-VQA and 59. 6% on A-OKVQA).

Image Captioning Question Answering +3

Paper
Add Code

Self-Instruct: Aligning Language Models with Self-Generated Instructions

16 code implementations • 20 Dec 2022 • Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi

Applying our method to the vanilla GPT3, we demonstrate a 33% absolute improvement over the original model on Super-NaturalInstructions, on par with the performance of InstructGPT-001, which was trained with private user data and human annotations.

Instruction Following Language Modelling

28,723

Paper
Code

One Embedder, Any Task: Instruction-Finetuned Text Embeddings

3 code implementations • 19 Dec 2022 • Hongjin Su, Weijia Shi, Jungo Kasai, Yizhong Wang, Yushi Hu, Mari Ostendorf, Wen-tau Yih, Noah A. Smith, Luke Zettlemoyer, Tao Yu

Our analysis suggests that INSTRUCTOR is robust to changes in instructions, and that instruction finetuning mitigates the challenge of training a single model on diverse datasets.

Information Retrieval Learning Word Embeddings +3

3,997

Paper
Code

Demystifying Prompts in Language Models via Perplexity Estimation

no code implementations • 8 Dec 2022 • Hila Gonen, Srini Iyer, Terra Blevins, Noah A. Smith, Luke Zettlemoyer

Language models can be prompted to perform a wide variety of zero- and few-shot learning problems.

Few-Shot Learning

Paper
Add Code

Data-Efficient Finetuning Using Cross-Task Nearest Neighbors

1 code implementation • 1 Dec 2022 • Hamish Ivison, Noah A. Smith, Hannaneh Hajishirzi, Pradeep Dasigi

Obtaining labeled data to train a model for a task of interest is often expensive.

Paper
Code

Domain Mismatch Doesn't Always Prevent Cross-Lingual Transfer Learning

no code implementations • 30 Nov 2022 • Daniel Edmiston, Phillip Keung, Noah A. Smith

Bilingual Lexicon Induction Cross-Lingual Transfer +5

Paper
Add Code

How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers

1 code implementation • 7 Nov 2022 • Michael Hassid, Hao Peng, Daniel Rotem, Jungo Kasai, Ivan Montero, Noah A. Smith, Roy Schwartz

Our results motivate research on simpler alternatives to input-dependent attention, as well as on methods for better utilization of this mechanism in the Transformer architecture.

Paper
Code

Modeling Context With Linear Attention for Scalable Document-Level Translation

1 code implementation • 16 Oct 2022 • Zhaofeng Wu, Hao Peng, Nikolaos Pappas, Noah A. Smith

Document-level machine translation leverages inter-sentence dependencies to produce more coherent and consistent translations.

Document Level Machine Translation Document Translation +4

Paper
Code

Transparency Helps Reveal When Language Models Learn Meaning

1 code implementation • 14 Oct 2022 • Zhaofeng Wu, William Merrill, Hao Peng, Iz Beltagy, Noah A. Smith

Many current NLP systems are built from language models trained to optimize unsupervised objectives on large amounts of raw text.

Paper
Code

Measuring and Narrowing the Compositionality Gap in Language Models

1 code implementation • 7 Oct 2022 • Ofir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah A. Smith, Mike Lewis

We investigate the ability of language models to perform compositional reasoning tasks where the overall solution depends on correctly composing the answers to sub-problems.

Ranked #4 on Question Answering on Bamboogle

Question Answering

281

Paper
Code

Binding Language Models in Symbolic Languages

1 code implementation • 6 Oct 2022 • Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu

We propose Binder, a training-free neural-symbolic framework that maps the task input to a program, which (1) allows binding a unified API of language model (LM) functionalities to a programming language (e. g., SQL, Python) to extend its grammar coverage and thus tackle more diverse questions, (2) adopts an LM as both the program parser and the underlying model called by the API during execution, and (3) requires only a few in-context exemplar annotations.

Ranked #4 on Table-based Fact Verification on TabFact

Language Modelling Semantic Parsing +1

274

Paper
Code

Selective Annotation Makes Language Models Better Few-Shot Learners

1 code implementation • 5 Sep 2022 • Hongjin Su, Jungo Kasai, Chen Henry Wu, Weijia Shi, Tianlu Wang, Jiayi Xin, Rui Zhang, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu

Departing from recent in-context learning methods, we formulate an annotation-efficient, two-step framework: selective annotation that chooses a pool of examples to annotate from unlabeled data in advance, followed by prompt retrieval that retrieves task examples from the annotated pool at test time.

Code Generation In-Context Learning +1

Paper
Code

Elaboration-Generating Commonsense Question Answering at Scale

1 code implementation • 2 Sep 2022 • Wenya Wang, Vivek Srikumar, Hanna Hajishirzi, Noah A. Smith

In question answering requiring common sense, language models (e. g., GPT-3) have been used to generate text expressing background knowledge that helps improve performance.

Common Sense Reasoning Question Answering

Paper
Code

Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models

2 code implementations • 5 Aug 2022 • Margaret Li, Suchin Gururangan, Tim Dettmers, Mike Lewis, Tim Althoff, Noah A. Smith, Luke Zettlemoyer

New ELMs are learned by branching from (mixtures of) ELMs in the current set, further training the parameters on data for the new domain, and then merging the resulting model back into the set for future use.

155

Paper
Code

RealTime QA: What's the Answer Right Now?

1 code implementation • NeurIPS 2023 • Jungo Kasai, Keisuke Sakaguchi, Yoichi Takahashi, Ronan Le Bras, Akari Asai, Xinyan Yu, Dragomir Radev, Noah A. Smith, Yejin Choi, Kentaro Inui

We introduce REALTIME QA, a dynamic question answering (QA) platform that announces questions and evaluates systems on a regular basis (weekly in this version).

Information Retrieval Question Answering +1

Paper
Code

Measuring the Carbon Intensity of AI in Cloud Instances

no code implementations • 10 Jun 2022 • Jesse Dodge, Taylor Prewitt, Remi Tachet des Combes, Erika Odmark, Roy Schwartz, Emma Strubell, Alexandra Sasha Luccioni, Noah A. Smith, Nicole DeCario, Will Buchanan

By providing unprecedented access to computational resources, cloud computing has enabled rapid growth in technologies such as machine learning, the computational demands of which incur a high energy cost and a commensurate carbon footprint.

Cloud Computing Language Modelling

Paper
Add Code

Unsupervised Learning of Hierarchical Conversation Structure

1 code implementation • 24 May 2022 • Bo-Ru Lu, Yushi Hu, Hao Cheng, Noah A. Smith, Mari Ostendorf

Human conversations can evolve in many different ways, creating challenges for automatic understanding and summarization.

Paper
Code

Twist Decoding: Diverse Generators Guide Each Other

1 code implementation • 19 May 2022 • Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Hao Peng, Ximing Lu, Dragomir Radev, Yejin Choi, Noah A. Smith

Our extensive evaluations on machine translation and scientific paper summarization demonstrate that Twist decoding substantially outperforms each model decoded in isolation over various scenarios, including cases where domain-specific and general-purpose models are both available.

Machine Translation Text Generation +1

Paper
Code

Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

7 code implementations • 16 Apr 2022 • Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza, Pulkit Verma, Ravsehaj Singh Puri, Rushang Karia, Shailaja Keyur Sampat, Savan Doshi, Siddhartha Mishra, Sujan Reddy, Sumanta Patro, Tanay Dixit, Xudong Shen, Chitta Baral, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi, Daniel Khashabi

This large and diverse collection of tasks enables rigorous benchmarking of cross-task generalization under instructions -- training models to follow instructions on a subset of tasks and evaluating them on the remaining unseen ones.

Benchmarking Instruction Following

894

Paper
Code

A Call for Clarity in Beam Search: How It Works and When It Stops

1 code implementation • 11 Apr 2022 • Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Dragomir Radev, Yejin Choi, Noah A. Smith

Based on this finding, we introduce a patience factor, a simple modification to this beam decoding implementation, that generalizes the stopping criterion and provides flexibility to the depth of search.

Machine Translation Text Generation +2

Paper
Code

In-Context Learning for Few-Shot Dialogue State Tracking

1 code implementation • 16 Mar 2022 • Yushi Hu, Chia-Hsuan Lee, Tianbao Xie, Tao Yu, Noah A. Smith, Mari Ostendorf

In this work, we propose an in-context learning (ICL) framework for zero-shot and few-shot learning DST, where a large pre-trained language model (LM) takes a test instance and a few exemplars as input, and directly decodes the dialogue state without any parameter updates.

Dialogue State Tracking Few-Shot Learning +3

Paper
Code

Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection

no code implementations • 25 Jan 2022 • Suchin Gururangan, Dallas Card, Sarah K. Dreier, Emily K. Gade, Leroy Z. Wang, Zeyu Wang, Luke Zettlemoyer, Noah A. Smith

Language models increasingly rely on massive web dumps for diverse text data.

Language Modelling

Paper
Add Code

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

1 code implementation • 16 Jan 2022 • Tianbao Xie, Chen Henry Wu, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, Chien-Sheng Wu, Ming Zhong, Pengcheng Yin, Sida I. Wang, Victor Zhong, Bailin Wang, Chengzu Li, Connor Boyle, Ansong Ni, Ziyu Yao, Dragomir Radev, Caiming Xiong, Lingpeng Kong, Rui Zhang, Noah A. Smith, Luke Zettlemoyer, Tao Yu

Structured knowledge grounding (SKG) leverages structured knowledge to complete user requests, such as semantic parsing over databases and question answering over knowledge bases.

Ranked #1 on Task-Oriented Dialogue Systems on KVRET

Few-Shot Learning Question Answering +3

529

Paper
Code

WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation

1 code implementation • 16 Jan 2022 • Alisa Liu, Swabha Swayamdipta, Noah A. Smith, Yejin Choi

Starting with an existing dataset, MultiNLI for natural language inference (NLI), our approach uses dataset cartography to automatically identify examples that demonstrate challenging reasoning patterns, and instructs GPT-3 to compose new examples with similar patterns.

Natural Language Inference Text Generation

Paper
Code

Imagined versus Remembered Stories: Quantifying Differences in Narrative Flow

no code implementations • 7 Jan 2022 • Maarten Sap, Anna Jafarpour, Yejin Choi, Noah A. Smith, James W. Pennebaker, Eric Horvitz

We quantify the differences between autobiographical and imagined stories by introducing sequentiality, a measure of narrative flow of events, drawing probabilistic inferences from a cutting-edge large language model (GPT-3).

Language Modelling Large Language Model +2

Paper
Add Code

NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics

1 code implementation • NAACL 2022 • Ximing Lu, Sean Welleck, Peter West, Liwei Jiang, Jungo Kasai, Daniel Khashabi, Ronan Le Bras, Lianhui Qin, Youngjae Yu, Rowan Zellers, Noah A. Smith, Yejin Choi

To enable constrained generation, we build on NeuroLogic decoding (Lu et al., 2021), combining its flexibility in incorporating logical constraints with A*esque estimates of future constraint satisfaction.

Ranked #1 on Text Generation on ROCStories

Machine Translation Table-to-Text Generation

Paper
Code

Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand

2 code implementations • NAACL 2022 • Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Lavinia Dunagan, Jacob Morrison, Alexander R. Fabbri, Yejin Choi, Noah A. Smith

We therefore propose a generalization of leaderboards, bidimensional leaderboards (Billboards), that simultaneously tracks progress in language generation models and metrics for their evaluation.

Image Captioning Machine Translation +1

Paper
Code

Transparent Human Evaluation for Image Captioning

2 code implementations • NAACL 2022 • Jungo Kasai, Keisuke Sakaguchi, Lavinia Dunagan, Jacob Morrison, Ronan Le Bras, Yejin Choi, Noah A. Smith

We establish THumB, a rubric-based human evaluation protocol for image captioning models.

Image Captioning

Paper
Code

Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection

no code implementations • NAACL 2022 • Maarten Sap, Swabha Swayamdipta, Laura Vianna, Xuhui Zhou, Yejin Choi, Noah A. Smith

The perceived toxicity of language can vary based on someone's identity and beliefs, but this variation is often ignored when collecting toxic language datasets, resulting in dataset and model biases.

Paper
Add Code

Time Waits for No One! Analysis and Challenges of Temporal Misalignment

1 code implementation • NAACL 2022 • Kelvin Luu, Daniel Khashabi, Suchin Gururangan, Karishma Mandyam, Noah A. Smith

When an NLP model is trained on text data from one time period and tested or deployed on data from another, the resulting temporal misalignment can degrade end-task performance.

Paper
Code

ABC: Attention with Bounded-memory Control

no code implementations • ACL 2022 • Hao Peng, Jungo Kasai, Nikolaos Pappas, Dani Yogatama, Zhaofeng Wu, Lingpeng Kong, Roy Schwartz, Noah A. Smith

One way to improve the efficiency is to bound the memory size.

Language Modelling Machine Translation

Paper
Add Code

Expected Validation Performance and Estimation of a Random Variable's Maximum

no code implementations • 1 Oct 2021 • Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz, Noah A. Smith

We find that the two biased estimators lead to the fewest incorrect conclusions, which hints at the importance of minimizing variance and MSE.

Paper
Add Code

Sentence Bottleneck Autoencoders from Transformer Language Models

1 code implementation • EMNLP 2021 • Ivan Montero, Nikolaos Pappas, Noah A. Smith

Representation learning for text via pretraining a language model on a large corpus has become a standard starting point for building NLP systems.

Denoising Language Modelling +6

Paper
Code

Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation

7 code implementations • ICLR 2022 • Ofir Press, Noah A. Smith, Mike Lewis

Since the introduction of the transformer model by Vaswani et al. (2017), a fundamental question has yet to be answered: how does a model achieve extrapolation at inference time for sequences that are longer than it saw during training?

Inductive Bias Playing the Game of 2048 +2

47,189

Paper
Code

DEMix Layers: Disentangling Domains for Modular Language Modeling

2 code implementations • NAACL 2022 • Suchin Gururangan, Mike Lewis, Ari Holtzman, Noah A. Smith, Luke Zettlemoyer

We introduce a new domain expert mixture (DEMix) layer that enables conditioning a language model (LM) on the domain of the input text.

Language Modelling

Paper
Code

All That's `Human' Is Not Gold: Evaluating Human Evaluation of Generated Text

no code implementations • ACL 2021 • Elizabeth Clark, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan, Noah A. Smith

Human evaluations are typically considered the gold standard in natural language generation, but as models{'} fluency improves, how well can evaluators detect and judge machine-generated text?

nlg evaluation Text Generation

Paper
Add Code

Is GPT-3 Text Indistinguishable from Human Text? Scarecrow: A Framework for Scrutinizing Machine Text

no code implementations • ACL 2022 • Yao Dou, Maxwell Forbes, Rik Koncel-Kedziorski, Noah A. Smith, Yejin Choi

To support the broad range of real machine errors that can be identified by laypeople, the ten error categories of Scarecrow -- such as redundancy, commonsense errors, and incoherence -- are identified through several rounds of crowd annotation experiments without a predefined ontology.

Math Text Generation

Paper
Add Code

All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text

no code implementations • 30 Jun 2021 • Elizabeth Clark, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan, Noah A. Smith

Human evaluations are typically considered the gold standard in natural language generation, but as models' fluency improves, how well can evaluators detect and judge machine-generated text?

nlg evaluation Text Generation

Paper
Add Code

Saturated Transformers are Constant-Depth Threshold Circuits

no code implementations • 30 Jun 2021 • William Merrill, Ashish Sabharwal, Noah A. Smith

Transformers have become a standard neural network architecture for many NLP problems, motivating theoretical analysis of their power in terms of formal languages.

Hard Attention

Paper
Add Code

Scientific Language Models for Biomedical Knowledge Base Completion: An Empirical Study

1 code implementation • AKBC 2021 • Rahul Nadkarni, David Wadden, Iz Beltagy, Noah A. Smith, Hannaneh Hajishirzi, Tom Hope

Biomedical knowledge graphs (KGs) hold rich information on entities such as diseases, drugs, and genes.

Knowledge Base Completion Knowledge Graphs +1

Paper
Code

Specializing Multilingual Language Models: An Empirical Study

1 code implementation • EMNLP (MRL) 2021 • Ethan C. Chau, Noah A. Smith

Pretrained multilingual language models have become a common tool in transferring NLP capabilities to low-resource languages, often with adaptations.

Dependency Parsing named-entity-recognition +5

Paper
Code

Choose Your Own Adventure: Paired Suggestions in Collaborative Writing for Evaluating Story Generation Models

no code implementations • NAACL 2021 • Elizabeth Clark, Noah A. Smith

Story generation is an open-ended and subjective task, which poses a challenge for evaluating story generation models.

Story Generation

Paper
Add Code

DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts

1 code implementation • ACL 2021 • Alisa Liu, Maarten Sap, Ximing Lu, Swabha Swayamdipta, Chandra Bhagavatula, Noah A. Smith, Yejin Choi

Despite recent advances in natural language generation, it remains challenging to control attributes of generated text.

Language Modelling Text Generation

104

Paper
Code

A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers

1 code implementation • NAACL 2021 • Pradeep Dasigi, Kyle Lo, Iz Beltagy, Arman Cohan, Noah A. Smith, Matt Gardner

Readers of academic research papers often read with the goal of answering specific questions.

Ranked #1 on Evidence Selection on QASPER

Evidence Selection Question Answering

Paper
Code

Provable Limitations of Acquiring Meaning from Ungrounded Form: What Will Future Language Models Understand?

no code implementations • 22 Apr 2021 • William Merrill, Yoav Goldberg, Roy Schwartz, Noah A. Smith

We study whether assertions enable a system to emulate representations preserving semantic relations like equivalence.

Paper
Add Code

Go Forth and Prosper: Language Modeling with Ancient Textual History

1 code implementation • 18 Apr 2021 • Rik Koncel-Kedziorski, Noah A. Smith

This method can improve perplexity of pretrained LMs with no updates to the LM's own parameters.

Language Modelling

Paper
Code

Competency Problems: On Finding and Removing Artifacts in Language Data

no code implementations • EMNLP 2021 • Matt Gardner, William Merrill, Jesse Dodge, Matthew E. Peters, Alexis Ross, Sameer Singh, Noah A. Smith

In this work we argue that for complex language understanding tasks, all simple feature correlations are spurious, and we formalize this notion into a class of problems which we call competency problems.

Negation

Paper
Add Code

Probing Across Time: What Does RoBERTa Know and When?

1 code implementation • Findings (EMNLP) 2021 • Leo Z. Liu, Yizhong Wang, Jungo Kasai, Hannaneh Hajishirzi, Noah A. Smith

Models of language trained on very large corpora have been demonstrated useful for NLP.

Language Modelling

Paper
Code

Finetuning Pretrained Transformers into RNNs

1 code implementation • EMNLP 2021 • Jungo Kasai, Hao Peng, Yizhe Zhang, Dani Yogatama, Gabriel Ilharco, Nikolaos Pappas, Yi Mao, Weizhu Chen, Noah A. Smith

Specifically, we propose a swap-then-finetune procedure: in an off-the-shelf pretrained transformer, we replace the softmax attention with its linear-complexity recurrent alternative and then finetune.

Ranked #2 on Machine Translation on WMT2017 Chinese-English

Language Modelling Machine Translation +1

Paper
Code

Random Feature Attention

no code implementations • ICLR 2021 • Hao Peng, Nikolaos Pappas, Dani Yogatama, Roy Schwartz, Noah A. Smith, Lingpeng Kong

RFA can be used as a drop-in replacement for conventional softmax attention and offers a straightforward way of learning with recency bias through an optional gating mechanism.

Ranked #27 on Machine Translation on IWSLT2014 German-English

Language Modelling Machine Translation +3

Paper
Add Code

Challenges in Automated Debiasing for Toxic Language Detection

2 code implementations • EACL 2021 • Xuhui Zhou, Maarten Sap, Swabha Swayamdipta, Noah A. Smith, Yejin Choi

Overall, our findings show that debiasing a model trained on biased toxic language data is not as effective as simply relabeling the data to remove existing biases.

Fairness text-classification +1

Paper
Code

GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation

2 code implementations • 17 Jan 2021 • Daniel Khashabi, Gabriel Stanovsky, Jonathan Bragg, Nicholas Lourie, Jungo Kasai, Yejin Choi, Noah A. Smith, Daniel S. Weld

While often assumed a gold standard, effective human evaluation of text generation remains an important, open area for research.

Machine Translation Reading Comprehension +2

Paper
Code

Shortformer: Better Language Modeling using Shorter Inputs

1 code implementation • ACL 2021 • Ofir Press, Noah A. Smith, Mike Lewis

Increasing the input length has been a driver of progress in language modeling with transformers.

Ranked #26 on Language Modelling on WikiText-103

Language Modelling Position +1

146

Paper
Code

Infusing Finetuning with Semantic Dependencies

1 code implementation • 10 Dec 2020 • Zhaofeng Wu, Hao Peng, Noah A. Smith

For natural language processing systems, two kinds of evidence support the use of text representations from neural language models "pretrained" on large unannotated corpora: performance on application-inspired benchmarks (Peters et al., 2018, inter alia), and the emergence of syntactic abstractions in those representations (Tenney et al., 2019, inter alia).

Natural Language Understanding

Paper
Code

Thinking Like a Skeptic: Defeasible Inference in Natural Language

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Rachel Rudinger, Vered Shwartz, Jena D. Hwang, Chandra Bhagavatula, Maxwell Forbes, Ronan Le Bras, Noah A. Smith, Yejin Choi

Defeasible inference is a mode of reasoning in which an inference (X is a bird, therefore X flies) may be weakened or overturned in light of new evidence (X is a penguin).

Common Sense Reasoning Natural Language Inference +1

Paper
Code

Measuring Association Between Labels and Free-Text Rationales

1 code implementation • EMNLP 2021 • Sarah Wiegreffe, Ana Marasović, Noah A. Smith

In interpretable NLP, we require faithful rationales that reflect the model's decision-making process for an explained instance.

Decision Making Feature Importance +2

Paper
Code

Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Ana Marasović, Chandra Bhagavatula, Jae Sung Park, Ronan Le Bras, Noah A. Smith, Yejin Choi

Natural language rationales could provide intuitive, higher-level explanations that are easily understandable by humans, complementing the more broadly studied lower-level explanations based on gradients or attention weights.

Language Modelling Natural Language Inference +4

Paper
Code

Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings

no code implementations • 15 Oct 2020 • Phillip Keung, Julian Salazar, Yichao Lu, Noah A. Smith

We then improve an XLM-based unsupervised neural MT system pre-trained on Wikipedia by supplementing it with pseudo-parallel text mined from the same corpus, boosting unsupervised translation performance by up to 3. 5 BLEU on the WMT'14 French-English and WMT'16 German-English tasks and outperforming the previous state-of-the-art.

Machine Translation Sentence +2

Paper
Add Code

The Multilingual Amazon Reviews Corpus

1 code implementation • EMNLP 2020 • Phillip Keung, Yichao Lu, György Szarvas, Noah A. Smith

We present the Multilingual Amazon Reviews Corpus (MARC), a large-scale collection of Amazon reviews for multilingual text classification.

General Classification Multilingual text classification +4

Paper
Code

Plug and Play Autoencoders for Conditional Text Generation

1 code implementation • EMNLP 2020 • Florian Mai, Nikolaos Pappas, Ivan Montero, Noah A. Smith, James Henderson

Text autoencoders are commonly used for conditional generation tasks such as style transfer.

Conditional Text Generation Navigate +1

Paper
Code

Multilevel Text Alignment with Cross-Document Attention

1 code implementation • EMNLP 2020 • Xuhui Zhou, Nikolaos Pappas, Noah A. Smith

Text alignment finds application in tasks such as citation recommendation and plagiarism detection.

Citation Recommendation Sentence

Paper
Code

Evaluating NLP Models via Contrast Sets

no code implementations • 1 Oct 2020 • Matt Gardner, Yoav Artzi, Victoria Basmova, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, Nitish Gupta, Hanna Hajishirzi, Gabriel Ilharco, Daniel Khashabi, Kevin Lin, Jiangming Liu, Nelson F. Liu, Phoebe Mulcaire, Qiang Ning, Sameer Singh, Noah A. Smith, Sanjay Subramanian, Reut Tsarfaty, Eric Wallace, A. Zhang, Ben Zhou

Unfortunately, when a dataset has systematic gaps (e. g., annotation artifacts), these evaluations are misleading: a model can learn simple decision rules that perform well on the test set but do not capture a dataset's intended capabilities.

Reading Comprehension Sentiment Analysis

Paper
Add Code

Parsing with Multilingual BERT, a Small Corpus, and a Small Treebank

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Ethan C. Chau, Lucy H. Lin, Noah A. Smith

Pretrained multilingual contextual representations have shown great success, but due to the limits of their pretraining data, their benefits do not apply equally to all language varieties.

Dependency Parsing

Paper
Code

Grounded Compositional Outputs for Adaptive Language Modeling

1 code implementation • EMNLP 2020 • Nikolaos Pappas, Phoebe Mulcaire, Noah A. Smith

To our knowledge, the result is the first word-level language model with a size that does not depend on the training vocabulary.

Language Modelling

Paper
Code

RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models

2 code implementations • Findings of the Association for Computational Linguistics 2020 • Samuel Gehman, Suchin Gururangan, Maarten Sap, Yejin Choi, Noah A. Smith

We investigate the extent to which pretrained LMs can be prompted to generate toxic language, and the effectiveness of controllable text generation algorithms at preventing such toxic degeneration.

Sentence Text Generation

163

Paper
Code

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

6 code implementations • EMNLP 2020 • Swabha Swayamdipta, Roy Schwartz, Nicholas Lourie, Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith, Yejin Choi

Experiments across four datasets show that these model-dependent measures reveal three distinct regions in the data map, each with pronounced characteristics.

Model Optimization Out-of-Distribution Generalization

183

Paper
Code

A Mixture of h - 1 Heads is Better than h Heads

no code implementations • ACL 2020 • Hao Peng, Roy Schwartz, Dianqi Li, Noah A. Smith

Multi-head attentive neural architectures have achieved state-of-the-art results on a variety of natural language processing tasks.

Language Modelling Machine Translation +1

Paper
Add Code

Exploring the Effect of Author and Reader Identity in Online Story Writing: the STORIESINTHEWILD Corpus.

no code implementations • WS 2020 • Tal August, Maarten Sap, Elizabeth Clark, Katharina Reinecke, Noah A. Smith

We analyze the effect of author and reader characteristics and story writing setup on the quality of stories in a short storytelling task.

Paper
Add Code

Recollection versus Imagination: Exploring Human Memory and Cognition via Neural Language Models

no code implementations • ACL 2020 • Maarten Sap, Eric Horvitz, Yejin Choi, Noah A. Smith, James Pennebaker

We introduce a measure of narrative flow and use this to examine the narratives for imagined and recalled events.

Paper
Add Code

Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Translation

2 code implementations • ICLR 2021 • Jungo Kasai, Nikolaos Pappas, Hao Peng, James Cross, Noah A. Smith

We show that the speed disadvantage for autoregressive baselines compared to non-autoregressive methods has been overestimated in three aspects: suboptimal layer allocation, insufficient speed measurement, and lack of knowledge distillation.

Knowledge Distillation Machine Translation +1

Paper
Code

Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction

no code implementations • CL 2020 • Marta R. Costa-juss{\`a}, Cristina Espa{\~n}a-Bonet, Pascale Fung, Noah A. Smith

We introduce the Computational Linguistics special issue on Multilingual and Interlingual Semantic Representations for Natural Language Processing.

Paper
Add Code

A Mixture of $h-1$ Heads is Better than $h$ Heads

no code implementations • 13 May 2020 • Hao Peng, Roy Schwartz, Dianqi Li, Noah A. Smith

Multi-head attentive neural architectures have achieved state-of-the-art results on a variety of natural language processing tasks.

Language Modelling Machine Translation +1

Paper
Add Code

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks

6 code implementations • ACL 2020 • Suchin Gururangan, Ana Marasović, Swabha Swayamdipta, Kyle Lo, Iz Beltagy, Doug Downey, Noah A. Smith

Language models pretrained on text from a wide variety of sources form the foundation of today's NLP.

Citation Intent Classification

519

Paper
Code

A Formal Hierarchy of RNN Architectures

no code implementations • ACL 2020 • William Merrill, Gail Weiss, Yoav Goldberg, Roy Schwartz, Noah A. Smith, Eran Yahav

While formally extending these findings to unsaturated RNNs is left to future work, we hypothesize that the practical learnable capacity of unsaturated RNNs obeys a similar hierarchy.

Paper
Add Code

The Right Tool for the Job: Matching Model and Instance Complexities

1 code implementation • ACL 2020 • Roy Schwartz, Gabriel Stanovsky, Swabha Swayamdipta, Jesse Dodge, Noah A. Smith

Our method presents a favorable speed/accuracy tradeoff in almost all cases, producing models which are up to five times faster than the state of the art, while preserving their accuracy.

Natural Language Inference text-classification +1

Paper
Code

Evaluating Models' Local Decision Boundaries via Contrast Sets

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Matt Gardner, Yoav Artzi, Victoria Basmova, Jonathan Berant, Ben Bogin, Sihao Chen, Pradeep Dasigi, Dheeru Dua, Yanai Elazar, Ananth Gottumukkala, Nitish Gupta, Hanna Hajishirzi, Gabriel Ilharco, Daniel Khashabi, Kevin Lin, Jiangming Liu, Nelson F. Liu, Phoebe Mulcaire, Qiang Ning, Sameer Singh, Noah A. Smith, Sanjay Subramanian, Reut Tsarfaty, Eric Wallace, Ally Zhang, Ben Zhou

Reading Comprehension Sentiment Analysis

Paper
Code

Multi-View Learning for Vision-and-Language Navigation

no code implementations • 2 Mar 2020 • Qiaolin Xia, Xiujun Li, Chunyuan Li, Yonatan Bisk, Zhifang Sui, Jianfeng Gao, Yejin Choi, Noah A. Smith

Learning to navigate in a visual environment following natural language instructions is a challenging task because natural language instructions are highly variable, ambiguous, and under-specified.

MULTI-VIEW LEARNING Navigate +1

Paper
Add Code

Explaining Relationships Between Scientific Documents

1 code implementation • ACL 2021 • Kelvin Luu, Xinyi Wu, Rik Koncel-Kedziorski, Kyle Lo, Isabel Cachola, Noah A. Smith

We address the task of explaining relationships between two scientific documents using natural language text.

Language Modelling Large Language Model +1

Paper
Code

On Consequentialism and Fairness

no code implementations • 2 Jan 2020 • Dallas Card, Noah A. Smith

In this paper we provide a consequentialist critique of common definitions of fairness within machine learning, as well as a machine learning perspective on consequentialism.

BIG-bench Machine Learning Decision Making +2

Paper
Add Code

Improving Transformer Models by Reordering their Sublayers

2 code implementations • ACL 2020 • Ofir Press, Noah A. Smith, Omer Levy

Multilayer transformer networks consist of interleaved self-attention and feedforward sublayers.

Ranked #7 on Language Modelling on enwik8

Language Modelling Machine Translation +1

Paper
Code

Social Bias Frames: Reasoning about Social and Power Implications of Language

no code implementations • ACL 2020 • Maarten Sap, Saadia Gabriel, Lianhui Qin, Dan Jurafsky, Noah A. Smith, Yejin Choi

We introduce Social Bias Frames, a new conceptual formalism that aims to model the pragmatic frames in which people project social biases and stereotypes onto others.

Paper
Add Code

Situating Sentence Embedders with Nearest Neighbor Overlap

no code implementations • ICLR 2020 • Lucy H. Lin, Noah A. Smith

As distributed approaches to natural language semantics have developed and diversified, embedders for linguistic units larger than words have come to play an increasingly important role.

Sentence

Paper
Add Code

Low-Resource Parsing with Crosslingual Contextualized Representations

no code implementations • CONLL 2019 • Phoebe Mulcaire, Jungo Kasai, Noah A. Smith

Despite advances in dependency parsing, languages with small treebanks still present challenges.

Dependency Parsing

Paper
Add Code

Improving Natural Language Inference with a Pretrained Parser

1 code implementation • 18 Sep 2019 • Deric Pang, Lucy H. Lin, Noah A. Smith

We introduce a novel approach to incorporate syntax into natural language inference (NLI) models.

Natural Language Inference

Paper
Code

Knowledge Enhanced Contextual Word Representations

1 code implementation • IJCNLP 2019 • Matthew E. Peters, Mark Neumann, Robert L. Logan IV, Roy Schwartz, Vidur Joshi, Sameer Singh, Noah A. Smith

Contextual word representations, typically trained on unstructured, unlabeled text, do not contain any explicit grounding to real world entities and are often unable to remember facts about those entities.

Ranked #9 on Relation Classification on TACRED

Entity Linking Entity Typing +3

361

Paper
Code

Show Your Work: Improved Reporting of Experimental Results

4 code implementations • IJCNLP 2019 • Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz, Noah A. Smith

Research in natural language processing proceeds, in part, by demonstrating that new models achieve superior performance (e. g., accuracy) on held-out test data, compared to previous results.

2,093

Paper
Code

RNN Architecture Learning with Sparse Regularization

1 code implementation • IJCNLP 2019 • Jesse Dodge, Roy Schwartz, Hao Peng, Noah A. Smith

Our method also highlights the interpretable properties of rational RNNs.

Sentiment Analysis

Paper
Code

PaLM: A Hybrid Parser and Language Model

1 code implementation • IJCNLP 2019 • Hao Peng, Roy Schwartz, Noah A. Smith

We present PaLM, a hybrid parser and neural language model.

Language Modelling

Paper
Code

Topics to Avoid: Demoting Latent Confounds in Text Classification

1 code implementation • IJCNLP 2019 • Sachin Kumar, Shuly Wintner, Noah A. Smith, Yulia Tsvetkov

Despite impressive performance on many text classification tasks, deep neural networks tend to learn frequent superficial patterns that are specific to the training data and do not always generalize well.

General Classification Native Language Identification +2

Paper
Code

Shallow Syntax in Deep Water

no code implementations • 29 Aug 2019 • Swabha Swayamdipta, Matthew Peters, Brendan Roof, Chris Dyer, Noah A. Smith

Shallow syntax provides an approximation of phrase-syntactic structure of sentences; it can be produced with high accuracy, and is computationally cheap to obtain.

Paper
Add Code

Quoref: A Reading Comprehension Dataset with Questions Requiring Coreferential Reasoning

1 code implementation • IJCNLP 2019 • Pradeep Dasigi, Nelson F. Liu, Ana Marasović, Noah A. Smith, Matt Gardner

Machine comprehension of texts longer than a single sentence often requires coreference resolution.

coreference-resolution Reading Comprehension +1

Paper
Code

Green AI

2 code implementations • 22 Jul 2019 • Roy Schwartz, Jesse Dodge, Noah A. Smith, Oren Etzioni

Moreover, the financial cost of the computations can make it difficult for academics, students, and researchers, in particular those from emerging economies, to engage in deep learning research.

4,755

Paper
Code

Sentence Mover's Similarity: Automatic Evaluation for Multi-Sentence Texts

no code implementations • ACL 2019 • Elizabeth Clark, Asli Celikyilmaz, Noah A. Smith

For evaluating machine-generated texts, automatic methods hold the promise of avoiding collection of human judgments, which can be expensive and time-consuming.

Paper
Add Code

The Risk of Racial Bias in Hate Speech Detection

no code implementations • ACL 2019 • Maarten Sap, Dallas Card, Saadia Gabriel, Yejin Choi, Noah A. Smith

We investigate how annotators{'} insensitivity to differences in dialect can lead to racial bias in automatic hate speech detection models, potentially amplifying harm against minority populations.

Hate Speech Detection

Paper
Add Code

Is Attention Interpretable?

1 code implementation • ACL 2019 • Sofia Serrano, Noah A. Smith

Attention mechanisms have recently boosted performance on a range of NLP tasks.

General Classification text-classification +1

Paper
Code

Variational Pretraining for Semi-supervised Text Classification

1 code implementation • ACL 2019 • Suchin Gururangan, Tam Dang, Dallas Card, Noah A. Smith

We accompany this paper with code to pretrain and use VAMPIRE embeddings in downstream tasks.

General Classification Semi-Supervised Text Classification

175

Paper
Code

Evaluating Gender Bias in Machine Translation

1 code implementation • ACL 2019 • Gabriel Stanovsky, Noah A. Smith, Luke Zettlemoyer

We present the first challenge set and evaluation protocol for the analysis of gender bias in machine translation (MT).

coreference-resolution Machine Translation +2

Paper
Code

Inoculation by Fine-Tuning: A Method for Analyzing Challenge Datasets

no code implementations • NAACL 2019 • Nelson F. Liu, Roy Schwartz, Noah A. Smith

Several datasets have recently been constructed to expose brittleness in models trained on existing benchmarks.

Paper
Add Code

Linguistic Knowledge and Transferability of Contextual Representations

no code implementations • NAACL 2019 • Nelson F. Liu, Matt Gardner, Yonatan Belinkov, Matthew E. Peters, Noah A. Smith

Contextual word representations derived from large-scale neural language models are successful across a diverse set of NLP tasks, suggesting that they encode useful and transferable features of language.

Language Modelling

Paper
Add Code

To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks

no code implementations • WS 2019 • Matthew E. Peters, Sebastian Ruder, Noah A. Smith

While most previous work has focused on different pretraining objectives and architectures for transfer learning, we ask how to best adapt the pretrained model to a given target task.

Transfer Learning

Paper
Add Code

Measuring Online Debaters' Persuasive Skill from Text over Time

no code implementations • TACL 2019 • Kelvin Luu, Chenhao Tan, Noah A. Smith

We build on a widely used model of skill in two-player games and augment it with linguistic features of a debater{'}s content.

Paper
Add Code

Polyglot Contextual Representations Improve Crosslingual Transfer

1 code implementation • NAACL 2019 • Phoebe Mulcaire, Jungo Kasai, Noah A. Smith

We introduce Rosita, a method to produce multilingual contextual word representations by training a single language model on text from multiple languages.

Dependency Parsing Language Modelling +5

Paper
Code

Contextual Word Representations: A Contextual Introduction

3 code implementations • 15 Feb 2019 • Noah A. Smith

This introduction aims to tell the story of how we put words into computers.

Question Answering Translation +1

186

Paper
Code

Deep Weighted Averaging Classifiers

2 code implementations • 6 Nov 2018 • Dallas Card, Michael Zhang, Noah A. Smith

Recent advances in deep learning have achieved impressive gains in classification accuracy on a variety of types of data, including images and text.

General Classification

Paper
Code

You May Not Need Attention

1 code implementation • 31 Oct 2018 • Ofir Press, Noah A. Smith

In NMT, how far can we get without attention and without separate encoding and decoding?

NMT Translation

292

Paper
Code

ATOMIC: An Atlas of Machine Commonsense for If-Then Reasoning

2 code implementations • 31 Oct 2018 • Maarten Sap, Ronan LeBras, Emily Allaway, Chandra Bhagavatula, Nicholas Lourie, Hannah Rashkin, Brendan Roof, Noah A. Smith, Yejin Choi

We present ATOMIC, an atlas of everyday commonsense reasoning, organized through 877k textual descriptions of inferential knowledge.

Relation

Paper
Code

Syntactic Scaffolds for Semantic Structures

1 code implementation • EMNLP 2018 • Swabha Swayamdipta, Sam Thomson, Kenton Lee, Luke Zettlemoyer, Chris Dyer, Noah A. Smith

We introduce the syntactic scaffold, an approach to incorporating syntactic information into semantic tasks.

coreference-resolution

Paper
Code

Neural Cross-Lingual Named Entity Recognition with Minimal Resources

1 code implementation • EMNLP 2018 • Jiateng Xie, Zhilin Yang, Graham Neubig, Noah A. Smith, Jaime Carbonell

To improve robustness to word order differences, we propose to use self-attention, which allows for a degree of flexibility with respect to word order.

named-entity-recognition Named Entity Recognition +2

Paper
Code

Semantic Matching Against a Corpus: New Applications and Methods

no code implementations • 28 Aug 2018 • Lucy H. Lin, Scott Miles, Noah A. Smith

We consider the case of a domain expert who wishes to explore the extent to which a particular idea is expressed in a text collection.

Paper
Add Code

Rational Recurrences

1 code implementation • EMNLP 2018 • Hao Peng, Roy Schwartz, Sam Thomson, Noah A. Smith

We characterize this connection formally, defining rational recurrences to be recurrent hidden state update functions that can be written as the Forward calculation of a finite set of WFSAs.

Language Modelling text-classification +1

Paper
Code

Bridging CNNs, RNNs, and Weighted Finite-State Machines

no code implementations • ACL 2018 • Roy Schwartz, Sam Thomson, Noah A. Smith

Recurrent and convolutional neural networks comprise two distinct families of models that have proven to be useful for encoding natural language utterances.

General Classification Representation Learning +3

Paper
Add Code

Discovering Phonesthemes with Sparse Regularization

no code implementations • WS 2018 • Nelson F. Liu, Gina-Anne Levow, Noah A. Smith

We introduce a simple method for extracting non-arbitrary form-meaning representations from a collection of semantic vectors.

feature selection

Paper
Add Code

Neural Text Generation in Stories Using Entity Representations as Context

no code implementations • NAACL 2018 • Elizabeth Clark, Yangfeng Ji, Noah A. Smith

We introduce an approach to neural text generation that explicitly represents entities mentioned in the text.

Dialogue Generation Representation Learning +1

Paper
Add Code

The Importance of Calibration for Estimating Proportions from Annotations

no code implementations • NAACL 2018 • Dallas Card, Noah A. Smith

Estimating label proportions in a target corpus is a type of measurement that is useful for answering certain types of social-scientific questions.

Sentiment Analysis Text Categorization

Paper
Add Code

LSTMs Exploit Linguistic Attributes of Data

no code implementations • WS 2018 • Nelson F. Liu, Omer Levy, Roy Schwartz, Chenhao Tan, Noah A. Smith

While recurrent neural networks have found success in a variety of natural language processing applications, they are general models of sequential data.

Memorization Open-Ended Question Answering

Paper
Add Code

Toward Abstractive Summarization Using Semantic Representations

1 code implementation • HLT 2015 • Fei Liu, Jeffrey Flanigan, Sam Thomson, Norman Sadeh, Noah A. Smith

We present a novel abstractive summarization framework that draws on the recent development of a treebank for the Abstract Meaning Representation (AMR).

Abstractive Text Summarization

Paper
Code

Event2Mind: Commonsense Inference on Events, Intents, and Reactions

no code implementations • ACL 2018 • Hannah Rashkin, Maarten Sap, Emily Allaway, Noah A. Smith, Yejin Choi

We investigate a new commonsense inference task: given an event described in a short free-form text ("X drinks coffee in the morning"), a system reasons about the likely intents ("X wants to stay awake") and reactions ("X feels alert") of the event's participants.

Ranked #1 on Common Sense Reasoning on Event2Mind test

Common Sense Reasoning

Paper
Add Code

SoPa: Bridging CNNs, RNNs, and Weighted Finite-State Machines

2 code implementations • 15 May 2018 • Roy Schwartz, Sam Thomson, Noah A. Smith

Recurrent and convolutional neural networks comprise two distinct families of models that have proven to be useful for encoding natural language utterances.

Explainable artificial intelligence General Classification +3

Paper
Code

Backpropagating through Structured Argmax using a SPIGOT

1 code implementation • ACL 2018 • Hao Peng, Sam Thomson, Noah A. Smith

We introduce the structured projection of intermediate gradients optimization technique (SPIGOT), a new method for backpropagating through neural networks that include hard-decision structured predictions (e. g., parsing) in intermediate layers.

Dependency Parsing Semantic Dependency Parsing +2

Paper
Code

Sounding Board: A User-Centric and Content-Driven Social Chatbot

no code implementations • NAACL 2018 • Hao Fang, Hao Cheng, Maarten Sap, Elizabeth Clark, Ari Holtzman, Yejin Choi, Noah A. Smith, Mari Ostendorf

We present Sounding Board, a social chatbot that won the 2017 Amazon Alexa Prize.

Chatbot Dialogue Management +2

Paper
Add Code

Parsing Tweets into Universal Dependencies

1 code implementation • NAACL 2018 • Yijia Liu, Yi Zhu, Wanxiang Che, Bing Qin, Nathan Schneider, Noah A. Smith

Nonetheless, using the new treebank, we build a pipeline system to parse raw tweets into UD.

Ranked #2 on Dependency Parsing on Tweebank

Computational Efficiency Dependency Parsing +1

Paper
Code

Learning Joint Semantic Parsers from Disjoint Data

2 code implementations • NAACL 2018 • Hao Peng, Sam Thomson, Swabha Swayamdipta, Noah A. Smith

We present a new approach to learning semantic parsers from multiple datasets, even when the target semantic formalisms are drastically different, and the underlying corpora do not overlap.

Dependency Parsing Semantic Dependency Parsing

Paper
Code

Annotation Artifacts in Natural Language Inference Data

no code implementations • NAACL 2018 • Suchin Gururangan, Swabha Swayamdipta, Omer Levy, Roy Schwartz, Samuel R. Bowman, Noah A. Smith

Large-scale datasets for natural language inference are created by presenting crowd workers with a sentence (premise), and asking them to generate three new sentences (hypotheses) that it entails, contradicts, or is logically neutral with respect to.

Natural Language Inference Negation +2

Paper
Add Code

"You are no Jack Kennedy": On Media Selection of Highlights from Presidential Debates

no code implementations • 23 Feb 2018 • Chenhao Tan, Hao Peng, Noah A. Smith

We first examine the effect of wording and propose a binary classification framework that controls for both the speaker and the debate situation.

Binary Classification

Paper
Add Code

Dynamic Entity Representations in Neural Language Models

2 code implementations • EMNLP 2017 • Yangfeng Ji, Chenhao Tan, Sebastian Martschat, Yejin Choi, Noah A. Smith

Understanding a long document requires tracking how entities are introduced and evolve over time.

Language Modelling

129

Paper
Code

End-to-End Neural Segmental Models for Speech Recognition

no code implementations • 1 Aug 2017 • Hao Tang, Liang Lu, Lingpeng Kong, Kevin Gimpel, Karen Livescu, Chris Dyer, Noah A. Smith, Steve Renals

Segmental models are an alternative to frame-based models for sequence prediction, where hypothesized path weights are based on entire segment scores rather than a single frame at a time.

speech-recognition Speech Recognition

Paper
Add Code

Frame-Semantic Parsing with Softmax-Margin Segmental RNNs and a Syntactic Scaffold

10 code implementations • 29 Jun 2017 • Swabha Swayamdipta, Sam Thomson, Chris Dyer, Noah A. Smith

We present a new, efficient frame-semantic parser that labels semantic arguments to FrameNet predicates.

Semantic Parsing

221

Paper
Code

Open Loop Hyperparameter Optimization and Determinantal Point Processes

no code implementations • ICLR 2018 • Jesse Dodge, Kevin Jamieson, Noah A. Smith

Driven by the need for parallelizable hyperparameter optimization methods, this paper studies \emph{open loop} search methods: sequences that are predetermined and can be generated before a single configuration is evaluated.

Hyperparameter Optimization Point Processes

Paper
Add Code

Greedy Transition-Based Dependency Parsing with Stack LSTMs

no code implementations • CL 2017 • Miguel Ballesteros, Chris Dyer, Yoav Goldberg, Noah A. Smith

During training, dynamic oracles alternate between sampling parser states from the training data and from the model as it is being learned, making the model more robust to the kinds of errors that will be made at test time.

Transition-Based Dependency Parsing

Paper
Add Code

Neural Models for Documents with Metadata

3 code implementations • ACL 2018 • Dallas Card, Chenhao Tan, Noah A. Smith

Most real-world document collections involve various types of metadata, such as author, source, and date, and yet the most commonly-used approaches to modeling text corpora ignore this information.

Topic Models Variational Inference

101

Paper
Code

Friendships, Rivalries, and Trysts: Characterizing Relations between Ideas in Texts

1 code implementation • ACL 2017 • Chenhao Tan, Dallas Card, Noah A. Smith

Combining two statistics --- cooccurrence within documents and prevalence correlation over time --- our approach reveals a number of different ways in which ideas can cooperate and compete.

Paper
Code

Deep Multitask Learning for Semantic Dependency Parsing

1 code implementation • ACL 2017 • Hao Peng, Sam Thomson, Noah A. Smith

We present a deep neural architecture that parses sentences into three semantic dependency graph formalisms.

Dependency Parsing Semantic Dependency Parsing

Paper
Code

Story Cloze Task: UW NLP System

no code implementations • WS 2017 • Roy Schwartz, Maarten Sap, Ioannis Konstas, Leila Zilles, Yejin Choi, Noah A. Smith

This paper describes University of Washington NLP{'}s submission for the Linking Models of Lexical, Sentential and Discourse-level Semantics (LSDSem 2017) shared task{---}the Story Cloze Task.

Language Modelling

Paper
Add Code

Multitask Learning with CTC and Segmental CRF for Speech Recognition

no code implementations • 21 Feb 2017 • Liang Lu, Lingpeng Kong, Chris Dyer, Noah A. Smith

Segmental conditional random fields (SCRFs) and connectionist temporal classification (CTC) are two sequence labeling methods used for end-to-end training of speech recognition models.

speech-recognition Speech Recognition

Paper
Add Code

The Effect of Different Writing Tasks on Linguistic Style: A Case Study of the ROC Story Cloze Task

1 code implementation • CONLL 2017 • Roy Schwartz, Maarten Sap, Ioannis Konstas, Li Zilles, Yejin Choi, Noah A. Smith

A writer's style depends not just on personal traits but also on her intent and mental state.

Language Modelling

Paper
Code

What Do Recurrent Neural Network Grammars Learn About Syntax?

1 code implementation • EACL 2017 • Adhiguna Kuncoro, Miguel Ballesteros, Lingpeng Kong, Chris Dyer, Graham Neubig, Noah A. Smith

We investigate what information they learn, from a linguistic perspective, through various ablations to the model and the data, and by augmenting the model with an attention mechanism (GA-RNNG) to enable closer inspection.

Ranked #20 on Constituency Parsing on Penn Treebank

Constituency Parsing Dependency Parsing +1

186

Paper
Code

A Neural Model for Language Identification in Code-Switched Tweets

no code implementations • WS 2016 • Aaron Jaech, George Mulcaire, Mari Ostendorf, Noah A. Smith

Language Identification Language Modelling +2

Paper
Add Code

Character Sequence Models for Colorful Words

no code implementations • EMNLP 2016 • Kazuya Kawakami, Chris Dyer, Bryan Routledge, Noah A. Smith

Language Modelling

Paper
Add Code

Semi-Supervised Learning of Sequence Models with Method of Moments

no code implementations • EMNLP 2016 • Zita Marinho, Andr{\'e} F. T. Martins, Shay B. Cohen, Noah A. Smith

Part-Of-Speech Tagging

Paper
Add Code

Friends with Motives: Using Text to Infer Influence on SCOTUS

no code implementations • EMNLP 2016 • Yanchuan Sim, Bryan Routledge, Noah A. Smith

Decision Making

Paper
Add Code

Analyzing Framing through the Casts of Characters in the News

no code implementations • EMNLP 2016 • Dallas Card, Justin Gross, Amber Boydstun, Noah A. Smith

Clustering Model Selection +1

Paper
Add Code

Training with Exploration Improves a Greedy Stack LSTM Parser

no code implementations • EMNLP 2016 • Miguel Ballesteros, Yoav Goldberg, Chris Dyer, Noah A. Smith

Paper
Add Code

Character Sequence Models for ColorfulWords

no code implementations • 28 Sep 2016 • Kazuya Kawakami, Chris Dyer, Bryan R. Routledge, Noah A. Smith

We present a neural network architecture to predict a point in color space from the sequence of characters in the color's name.

Paper
Add Code

Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser

1 code implementation • EMNLP 2016 • Adhiguna Kuncoro, Miguel Ballesteros, Lingpeng Kong, Chris Dyer, Noah A. Smith

We introduce two first-order graph-based dependency parsers achieving a new state of the art.

Ranked #17 on Dependency Parsing on Penn Treebank

Dependency Parsing

Paper
Code

Hierarchical Character-Word Models for Language Identification

1 code implementation • WS 2016 • Aaron Jaech, George Mulcaire, Shobhit Hathi, Mari Ostendorf, Noah A. Smith

Social media messages' brevity and unconventional spelling pose a challenge to language identification.

Language Identification

Paper
Code

Greedy, Joint Syntactic-Semantic Parsing with Stack LSTMs

1 code implementation • CONLL 2016 • Swabha Swayamdipta, Miguel Ballesteros, Chris Dyer, Noah A. Smith

We present a transition-based parser that jointly produces syntactic and semantic dependencies.

Semantic Parsing

Paper
Code

CMU at SemEval-2016 Task 8: Graph-based AMR Parsing with Infinite Ramp Loss

no code implementations • SEMEVAL 2016 • Jeffrey Flanigan, Chris Dyer, Noah A. Smith, Jaime Carbonell

Ranked #3 on AMR Parsing on LDC2015E86

AMR Parsing Structured Prediction

Paper
Add Code

UW-CSE at SemEval-2016 Task 10: Detecting Multiword Expressions and Supersenses using Double-Chained Conditional Random Fields

1 code implementation • SEMEVAL 2016 • Mohammad Javad Hosseini, Noah A. Smith, Su-In Lee

Paper
Code

Generation from Abstract Meaning Representation using Tree Transducers

no code implementations • NAACL 2016 • Jeffrey Flanigan, Chris Dyer, Noah A. Smith, Jaime Carbonell

Language Modelling Machine Translation +1

Paper
Add Code

Training with Exploration Improves a Greedy Stack-LSTM Parser

no code implementations • 11 Mar 2016 • Miguel Ballesteros, Yoav Goldberg, Chris Dyer, Noah A. Smith

We adapt the greedy Stack-LSTM dependency parser of Dyer et al. (2015) to support a training-with-exploration procedure using dynamic oracles(Goldberg and Nivre, 2013) instead of cross-entropy minimization.

Ranked #2 on Chinese Dependency Parsing on Chinese Pennbank

Chinese Dependency Parsing Dependency Parsing

Paper
Add Code

Segmental Recurrent Neural Networks for End-to-end Speech Recognition

no code implementations • 1 Mar 2016 • Liang Lu, Lingpeng Kong, Chris Dyer, Noah A. Smith, Steve Renals

This model connects the segmental conditional random field (CRF) with a recurrent neural network (RNN) used for feature extraction.

Ranked #16 on Speech Recognition on TIMIT

Acoustic Modelling Language Modelling +2

Paper
Add Code

Recurrent Neural Network Grammars

6 code implementations • NAACL 2016 • Chris Dyer, Adhiguna Kuncoro, Miguel Ballesteros, Noah A. Smith

We introduce recurrent neural network grammars, probabilistic models of sentences with explicit phrase structure.

Ranked #25 on Constituency Parsing on Penn Treebank

Constituency Parsing Language Modelling

186

Paper
Code

Massively Multilingual Word Embeddings

1 code implementation • 5 Feb 2016 • Waleed Ammar, George Mulcaire, Yulia Tsvetkov, Guillaume Lample, Chris Dyer, Noah A. Smith

We introduce new methods for estimating and evaluating embeddings of words in more than fifty languages in a single shared embedding space.

Multilingual Word Embeddings Text Categorization

Paper
Code

Many Languages, One Parser

1 code implementation • TACL 2016 • Waleed Ammar, George Mulcaire, Miguel Ballesteros, Chris Dyer, Noah A. Smith

We train one multilingual model for dependency parsing and use it to parse sentences in several languages.

Ranked #2 on Cross-lingual zero-shot dependency parsing on Universal Dependency Treebank

Cross-lingual zero-shot dependency parsing POS

Paper
Code

Annotating Character Relationships in Literary Texts

no code implementations • 2 Dec 2015 • Philip Massey, Patrick Xia, David Bamman, Noah A. Smith

We present a dataset of manually annotated relationships between characters in literary texts, in order to support the training and evaluation of automatic methods for relation type prediction in this domain (Makazhanov et al., 2014; Kokkinakis, 2013) and the broader computational analysis of literary character (Elson et al., 2010; Bamman et al., 2014; Vala et al., 2015; Flekova and Gurevych, 2015).

Type prediction

Paper
Add Code

Segmental Recurrent Neural Networks

2 code implementations • 18 Nov 2015 • Lingpeng Kong, Chris Dyer, Noah A. Smith

Representations of the input segments (i. e., contiguous subsequences of the input) are computed by encoding their constituent tokens using bidirectional recurrent neural nets, and these "segment embeddings" are used to define compatibility scores with output labels.

Chinese Word Segmentation Handwriting Recognition +2