Search Results for author: Ryan Cotterell

Found 211 papers, 98 papers with code

SIGMORPHON–UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection

1 code implementation • NAACL (SIGMORPHON) 2022 • Jordan Kodner, Salam Khalifa, Khuyagbaatar Batsuren, Hossep Dolatian, Ryan Cotterell, Faruk Akkus, Antonios Anastasopoulos, Taras Andrushko, Aryaman Arora, Nona Atanalov, Gábor Bella, Elena Budianskaya, Yustinus Ghanggo Ate, Omer Goldman, David Guriel, Simon Guriel, Silvia Guriel-Agiashvili, Witold Kieraś, Andrew Krizhanovsky, Natalia Krizhanovsky, Igor Marchenko, Magdalena Markowska, Polina Mashkovtseva, Maria Nepomniashchaya, Daria Rodionova, Karina Scheifer, Alexandra Sorova, Anastasia Yemelina, Jeremiah Young, Ekaterina Vylomova

The 2022 SIGMORPHON–UniMorph shared task on large scale morphological inflection generation included a wide range of typologically diverse languages: 33 languages from 11 top-level language families: Arabic (Modern Standard), Assamese, Braj, Chukchi, Eastern Armenian, Evenki, Georgian, Gothic, Gujarati, Hebrew, Hungarian, Itelmen, Karelian, Kazakh, Ket, Khalkha Mongolian, Kholosi, Korean, Lamahalot, Low German, Ludic, Magahi, Middle Low German, Old English, Old High German, Old Norse, Polish, Pomak, Slovak, Turkish, Upper Sorbian, Veps, and Xibe.

Morphological Inflection

Paper
Code

Conditional Poisson Stochastic Beams

no code implementations • EMNLP 2021 • Clara Meister, Afra Amini, Tim Vieira, Ryan Cotterell

Beam search is the default decoding strategy for many sequence generation tasks in NLP.

Paper
Add Code

A surprisal–duration trade-off across and within the world’s languages

1 code implementation • EMNLP 2021 • Tiago Pimentel, Clara Meister, Elizabeth Salesky, Simone Teufel, Damián Blasi, Ryan Cotterell

We thus conclude that there is strong evidence of a surprisal–duration trade-off in operation, both across and within the world’s languages.

Paper
Code

Efficient Sampling of Dependency Structure

1 code implementation • EMNLP 2021 • Ran Zmigrod, Tim Vieira, Ryan Cotterell

In this paper, we adapt two spanning tree sampling algorithms to faithfully sample dependency trees from a graph subject to the root constraint.

Paper
Code

The SIGTYP 2022 Shared Task on the Prediction of Cognate Reflexes

1 code implementation • NAACL (SIGTYP) 2022 • Johann-Mattis List, Ekaterina Vylomova, Robert Forkel, Nathan Hill, Ryan Cotterell

This study describes the structure and the results of the SIGTYP 2022 shared task on the prediction of cognate reflexes from multilingual wordlists.

Image Restoration

Paper
Code

High probability or low information? The probability–quality paradox in language generation

no code implementations • ACL 2022 • Clara Meister, Gian Wiher, Tiago Pimentel, Ryan Cotterell

When generating natural language from neural probabilistic models, high probability does not always coincide with high quality.

Text Generation

Paper
Add Code

SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages

no code implementations • ACL (SIGMORPHON) 2021 • Tiago Pimentel, Maria Ryskina, Sabrina J. Mielke, Shijie Wu, Eleanor Chodroff, Brian Leonard, Garrett Nicolai, Yustinus Ghanggo Ate, Salam Khalifa, Nizar Habash, Charbel El-Khaissi, Omer Goldman, Michael Gasser, William Lane, Matt Coler, Arturo Oncevay, Jaime Rafael Montoya Samame, Gema Celeste Silva Villegas, Adam Ek, Jean-Philippe Bernardy, Andrey Shcherbakov, Aziyana Bayyr-ool, Karina Sheifer, Sofya Ganieva, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Andrew Krizhanovsky, Natalia Krizhanovsky, Clara Vania, Sardana Ivanova, Aelita Salchak, Christopher Straughn, Zoey Liu, Jonathan North Washington, Duygu Ataman, Witold Kieraś, Marcin Woliński, Totok Suhardijanto, Niklas Stoehr, Zahroh Nuriah, Shyam Ratan, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Richard J. Hatcher, Emily Prud'hommeaux, Ritesh Kumar, Mans Hulden, Botond Barta, Dorina Lakatos, Gábor Szolnok, Judit Ács, Mohit Raj, David Yarowsky, Ryan Cotterell, Ben Ambridge, Ekaterina Vylomova

This year's iteration of the SIGMORPHON Shared Task on morphological reinflection focuses on typological diversity and cross-lingual variation of morphosyntactic features.

Paper
Add Code

On the Machine Learning of Ethical Judgments from Natural Language

no code implementations • NAACL 2022 • Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, Adina Williams

Ethics is one of the longest standing intellectual endeavors of humanity.

BIG-bench Machine Learning Decision Making +1

Paper
Add Code

Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions

no code implementations • EMNLP 2020 • Arya D. McCarthy, Adina Williams, Shijia Liu, David Yarowsky, Ryan Cotterell

Of particular interest, languages on the same branch of our phylogenetic tree are notably similar, whereas languages from separate branches are no more similar than chance.

Community Detection

Paper
Add Code

Low-Resource Named Entity Recognition with Cross-Lingual, Character-Level Neural Conditional Random Fields

no code implementations • IJCNLP 2017 • Ryan Cotterell, Kevin Duh

Low-resource named entity recognition is still an open problem in NLP.

Low Resource Named Entity Recognition named-entity-recognition +2

Paper
Add Code

Labeled Morphological Segmentation with Semi-Markov Models

no code implementations • CONLL 2015 • Ryan Cotterell, Thomas Müller, Alexander Fraser, Hinrich Schütze

We present labeled morphological segmentation, an alternative view of morphological processing that unifies several tasks.

Segmentation TAG

Paper
Add Code

[Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

no code implementations • 9 Apr 2024 • Leshem Choshen, Ryan Cotterell, Michael Y. Hu, Tal Linzen, Aaron Mueller, Candace Ross, Alex Warstadt, Ethan Wilcox, Adina Williams, Chengxu Zhuang

The big changes for this year's competition are as follows: First, we replace the loose track with a paper track, which allows (for example) non-model-based submissions, novel cognitively-inspired benchmarks, or analysis techniques.

Paper
Add Code

Context versus Prior Knowledge in Language Models

no code implementations • 6 Apr 2024 • Kevin Du, Vésteinn Snæbjarnarson, Niklas Stoehr, Jennifer C. White, Aaron Schein, Ryan Cotterell

To answer a question, language models often need to integrate prior knowledge learned during pretraining and new information presented in context.

Paper
Add Code

The Role of $n$-gram Smoothing in the Age of Neural Networks

no code implementations • 25 Mar 2024 • Luca Malagutti, Andrius Buinovskij, Anej Svete, Clara Meister, Afra Amini, Ryan Cotterell

For nearly three decades, language models derived from the $n$-gram assumption held the state of the art on the task.

Language Modelling Machine Translation

Paper
Add Code

Towards Explainability in Legal Outcome Prediction Models

1 code implementation • 25 Mar 2024 • Josef Valvoda, Ryan Cotterell

Current legal outcome prediction models - a staple of legal NLP - do not explain their reasoning.

Paper
Code

On the Challenges and Opportunities in Generative AI

no code implementations • 28 Feb 2024 • Laura Manduchi, Kushagra Pandey, Robert Bamler, Ryan Cotterell, Sina Däubener, Sophie Fellenz, Asja Fischer, Thomas Gärtner, Matthias Kirchler, Marius Kloft, Yingzhen Li, Christoph Lippert, Gerard de Melo, Eric Nalisnick, Björn Ommer, Rajesh Ranganath, Maja Rudolph, Karen Ullrich, Guy Van Den Broeck, Julia E Vogt, Yixin Wang, Florian Wenzel, Frank Wood, Stephan Mandt, Vincent Fortuin

The field of deep generative modeling has grown rapidly and consistently over the years.

Paper
Add Code

A Theoretical Result on the Inductive Bias of RNN Language Models

no code implementations • 24 Feb 2024 • Anej Svete, Robin Shing Moon Chan, Ryan Cotterell

However, a closer inspection of Hewitt et al.'s (2020) construction shows that it is not limited to hierarchical LMs, posing the question of what \emph{other classes} of LMs can be efficiently represented by RNNs.

Inductive Bias

Paper
Add Code

What Changed? Converting Representational Interventions to Natural Language

no code implementations • 17 Feb 2024 • Matan Avitan, Ryan Cotterell, Yoav Goldberg, Shauli Ravfogel

Interventions targeting the representation space of language models (LMs) have emerged as effective means to influence model behavior.

counterfactual

Paper
Add Code

Direct Preference Optimization with an Offset

no code implementations • 16 Feb 2024 • Afra Amini, Tim Vieira, Ryan Cotterell

DPO, as originally formulated, relies on binary preference data and fine-tunes a language model to increase the likelihood of a preferred response over a dispreferred response.

Language Modelling

Paper
Add Code

MiMiC: Minimally Modified Counterfactuals in the Representation Space

no code implementations • 15 Feb 2024 • Shashwat Singh, Shauli Ravfogel, Jonathan Herzig, Roee Aharoni, Ryan Cotterell, Ponnurangam Kumaraguru

We demonstrate the effectiveness of the proposed approaches in mitigating bias in multiclass classification and in reducing the generation of toxic language, outperforming strong baselines.

Paper
Add Code

Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?

no code implementations • 31 Jan 2024 • Andreas Opedal, Alessandro Stolfo, Haruki Shirakami, Ying Jiao, Ryan Cotterell, Bernhard Schölkopf, Abulhair Saparov, Mrinmaya Sachan

We find evidence that LLMs, with and without instruction-tuning, exhibit human-like biases in both the text-comprehension and the solution-planning steps of the solving process, but not during the final step which relies on the problem's arithmetic expressions (solution execution).

Reading Comprehension

Paper
Add Code

Principled Gradient-based Markov Chain Monte Carlo for Text Generation

no code implementations • 29 Dec 2023 • Li Du, Afra Amini, Lucas Torroba Hennigen, Xinyan Velocity Yu, Jason Eisner, Holden Lee, Ryan Cotterell

Recent papers have demonstrated the possibility of energy-based text generation by adapting gradient-based sampling algorithms, a paradigm of MCMC algorithms that promises fast convergence.

Language Modelling Text Generation

Paper
Add Code

Revisiting the Optimality of Word Lengths

no code implementations • 6 Dec 2023 • Tiago Pimentel, Clara Meister, Ethan Gotlieb Wilcox, Kyle Mahowald, Ryan Cotterell

Under this method, we find that a language's word lengths should instead be proportional to the surprisal's expectation plus its variance-to-mean ratio.

Paper
Add Code

The Ethics of Automating Legal Actors

no code implementations • 1 Dec 2023 • Josef Valvoda, Alec Thompson, Ryan Cotterell, Simone Teufel

The introduction of large public legal datasets has brought about a renaissance in legal NLP.

Ethics

Paper
Add Code

Grammatical Gender's Influence on Distributional Semantics: A Causal Perspective

no code implementations • 30 Nov 2023 • Karolina Stańczak, Kevin Du, Adina Williams, Isabelle Augenstein, Ryan Cotterell

However, when we control for the meaning of the noun, we find that grammatical gender has a near-zero effect on adjective choice, thereby calling the neo-Whorfian hypothesis into question.

Paper
Add Code

Quantifying the redundancy between prosody and text

1 code implementation • 28 Nov 2023 • Lukas Wolf, Tiago Pimentel, Evelina Fedorenko, Ryan Cotterell, Alex Warstadt, Ethan Wilcox, Tamar Regev

Using a large spoken corpus of English audiobooks, we extract prosodic features aligned to individual words and test how well they can be predicted from LLM embeddings, compared to non-contextual word embeddings.

Word Embeddings

Paper
Code

An Exploration of Left-Corner Transformations

no code implementations • 27 Nov 2023 • Andreas Opedal, Eleftheria Tsipidi, Tiago Pimentel, Ryan Cotterell, Tim Vieira

The left-corner transformation (Rosenkrantz and Lewis, 1970) is used to remove left recursion from context-free grammars, which is an important step towards making the grammar parsable top-down with simple techniques.

Paper
Add Code

Formal Aspects of Language Modeling

no code implementations • 7 Nov 2023 • Ryan Cotterell, Anej Svete, Clara Meister, Tianyu Liu, Li Du

Large language models have become one of the most commonly deployed NLP inventions.

Language Modelling

Paper
Add Code

Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages

no code implementations • 23 Oct 2023 • Alexandra Butoi, Tim Vieira, Ryan Cotterell, David Chiang

From these, we also immediately obtain stringsum and allsum algorithms for TAG, LIG, PAA, and EPDA.

TAG

Paper
Add Code

On the Representational Capacity of Recurrent Neural Language Models

1 code implementation • 19 Oct 2023 • Franz Nowak, Anej Svete, Li Du, Ryan Cotterell

We extend the Turing completeness result to the probabilistic case, showing how a rationally weighted RLM with unbounded computation time can simulate any deterministic probabilistic Turing machine (PTM) with rationally weighted transitions.

Paper
Code

Recurrent Neural Language Models as Probabilistic Finite-state Automata

1 code implementation • 8 Oct 2023 • Anej Svete, Ryan Cotterell

These results present a first step towards characterizing the classes of distributions RNN LMs can represent and thus help us understand their capabilities and limitations.

Paper
Code

An Analysis of On-the-fly Determinization of Finite-state Automata

no code implementations • 27 Aug 2023 • Ivan Baburin, Ryan Cotterell

In this paper we establish an abstraction of on-the-fly determinization of finite-state automata using transition monoids and demonstrate how it can be applied to bound the asymptotics.

Paper
Add Code

A Geometric Notion of Causal Probing

no code implementations • 27 Jul 2023 • Clément Guerner, Anej Svete, Tianyu Liu, Alexander Warstadt, Ryan Cotterell

The linear subspace hypothesis (Bolukbasi et al., 2016) states that, in a language model's representation space, all information about a concept such as verbal number is encoded in a linear subspace.

counterfactual Language Modelling

Paper
Add Code

Testing the Predictions of Surprisal Theory in 11 Languages

no code implementations • 7 Jul 2023 • Ethan Gotlieb Wilcox, Tiago Pimentel, Clara Meister, Ryan Cotterell, Roger P. Levy

We address this gap in the current literature by investigating the relationship between surprisal and reading times in eleven different languages, distributed across five language families.

Paper
Add Code

On the Efficacy of Sampling Adapters

1 code implementation • 7 Jul 2023 • Clara Meister, Tiago Pimentel, Luca Malagutti, Ethan G. Wilcox, Ryan Cotterell

While this trade-off is not reflected in standard metrics of distribution quality (such as perplexity), we find that several precision-emphasizing measures indeed indicate that sampling adapters can lead to probability distributions more aligned with the true distribution.

Text Generation

Paper
Code

Efficient Semiring-Weighted Earley Parsing

1 code implementation • 6 Jul 2023 • Andreas Opedal, Ran Zmigrod, Tim Vieira, Ryan Cotterell, Jason Eisner

This paper provides a reference description, in the form of a deduction system, of Earley's (1970) context-free parsing algorithm with various speed-ups.

Sentence

Paper
Code

Generalizing Backpropagation for Gradient-Based Interpretability

1 code implementation • 6 Jul 2023 • Kevin Du, Lucas Torroba Hennigen, Niklas Stoehr, Alexander Warstadt, Ryan Cotterell

Many popular feature-attribution methods for interpreting deep neural networks rely on computing the gradients of a model's output with respect to its inputs.

Paper
Code

A Formal Perspective on Byte-Pair Encoding

1 code implementation • 29 Jun 2023 • Vilém Zouhar, Clara Meister, Juan Luis Gastaldi, Li Du, Tim Vieira, Mrinmaya Sachan, Ryan Cotterell

Via submodular functions, we prove that the iterative greedy version is a $\frac{1}{{\sigma(\boldsymbol{\mu}^\star)}}(1-e^{-{\sigma(\boldsymbol{\mu}^\star)}})$-approximation of an optimal merge sequence, where ${\sigma(\boldsymbol{\mu}^\star)}$ is the total backward curvature with respect to the optimal merge sequence $\boldsymbol{\mu}^\star$.

Combinatorial Optimization

Paper
Code

Tokenization and the Noiseless Channel

1 code implementation • 29 Jun 2023 • Vilém Zouhar, Clara Meister, Juan Luis Gastaldi, Li Du, Mrinmaya Sachan, Ryan Cotterell

Subword tokenization is a key part of many NLP pipelines.

Machine Translation

Paper
Code

Hexatagging: Projective Dependency Parsing as Tagging

1 code implementation • 8 Jun 2023 • Afra Amini, Tianyu Liu, Ryan Cotterell

We introduce a novel dependency parser, the hexatagger, that constructs dependency trees by tagging the words in a sentence with elements from a finite set of possible tags.

Computational Efficiency Dependency Parsing +2

Paper
Code

Convergence and Diversity in the Control Hierarchy

no code implementations • 6 Jun 2023 • Alexandra Butoi, Ryan Cotterell, David Chiang

Furthermore, using an even stricter notion of equivalence called d-strong equivalence, we make precise the intuition that a CFG controlling a CFG is a TAG, a PDA controlling a PDA is an embedded PDA, and a PDA controlling a CFG is a LIG.

TAG

Paper
Add Code

A Cross-Linguistic Pressure for Uniform Information Density in Word Order

1 code implementation • 6 Jun 2023 • Thomas Hikaru Clark, Clara Meister, Tiago Pimentel, Michael Hahn, Ryan Cotterell, Richard Futrell, Roger Levy

Here, we ask whether a pressure for UID may have influenced word order patterns cross-linguistically.

counterfactual

Paper
Code

LEACE: Perfect linear concept erasure in closed form

1 code implementation • NeurIPS 2023 • Nora Belrose, David Schneider-Joseph, Shauli Ravfogel, Ryan Cotterell, Edward Raff, Stella Biderman

Concept erasure aims to remove specified features from a representation.

Fairness

186

Paper
Code

Structured Voronoi Sampling

1 code implementation • NeurIPS 2023 • Afra Amini, Li Du, Ryan Cotterell

In this paper, we take an important step toward building a principled approach for sampling from language models with gradient-based methods.

Text Generation

Paper
Code

Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective

no code implementations • 24 May 2023 • Tianyu Liu, Afra Amini, Mrinmaya Sachan, Ryan Cotterell

We show that these exhaustive comparisons can be avoided, and, moreover, the complexity of such tasks can be reduced to linear by casting the relation between tokens as a partial order over the string.

coreference-resolution Dependency Parsing +1

Paper
Add Code

All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations

1 code implementation • 23 May 2023 • Yuxin Ren, Qipeng Guo, Zhijing Jin, Shauli Ravfogel, Mrinmaya Sachan, Bernhard Schölkopf, Ryan Cotterell

Transformer models bring propelling advances in various NLP tasks, thus inducing lots of interpretability research on the learned representations of the models.

Paper
Code

RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text

2 code implementations • 22 May 2023 • Wangchunshu Zhou, Yuchen Eleanor Jiang, Peng Cui, Tiannan Wang, Zhenxin Xiao, Yifan Hou, Ryan Cotterell, Mrinmaya Sachan

In addition to producing AI-generated content (AIGC), we also demonstrate the possibility of using RecurrentGPT as an interactive fiction that directly interacts with consumers.

Language Modelling Large Language Model

878

Paper
Code

Efficient Prompting via Dynamic In-Context Learning

no code implementations • 18 May 2023 • Wangchunshu Zhou, Yuchen Eleanor Jiang, Ryan Cotterell, Mrinmaya Sachan

To achieve this, we train a meta controller that predicts the number of in-context examples suitable for the generalist model to make a good prediction based on the performance-efficiency trade-off for a specific input.

In-Context Learning

Paper
Add Code

Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus

1 code implementation • 18 May 2023 • Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Mrinmaya Sachan, Ryan Cotterell

Several recent papers claim human parity at sentence-level Machine Translation (MT), especially in high-resource languages.

Machine Translation Sentence +1

Paper
Code

Controlled Text Generation with Natural Language Instructions

1 code implementation • 27 Apr 2023 • Wangchunshu Zhou, Yuchen Eleanor Jiang, Ethan Wilcox, Ryan Cotterell, Mrinmaya Sachan

Large language models generate fluent texts and can follow natural language instructions to solve a wide range of tasks without task-specific training.

In-Context Learning Language Modelling +1

Paper
Code

Discriminative Class Tokens for Text-to-Image Diffusion Models

1 code implementation • ICCV 2023 • Idan Schwartz, Vésteinn Snæbjarnarson, Hila Chefer, Ryan Cotterell, Serge Belongie, Lior Wolf, Sagie Benaim

This approach has two disadvantages: (i) supervised datasets are generally small compared to large-scale scraped text-image datasets on which text-to-image models are trained, affecting the quality and diversity of the generated images, or (ii) the input is a hard-coded label, as opposed to free-form text, limiting the control over the generated images.

Paper
Code

Algorithms for Acyclic Weighted Finite-State Automata with Failure Arcs

1 code implementation • 17 Jan 2023 • Anej Svete, Benjamin Dayan, Tim Vieira, Ryan Cotterell, Jason Eisner

The pathsum in ordinary acyclic WFSAs is efficiently computed by the backward algorithm in time $O(|E|)$, where $E$ is the set of transitions.

Paper
Code

A Measure-Theoretic Characterization of Tight Language Models

no code implementations • 20 Dec 2022 • Li Du, Lucas Torroba Hennigen, Tiago Pimentel, Clara Meister, Jason Eisner, Ryan Cotterell

Language modeling, a central task in natural language processing, involves estimating a probability distribution over strings.

Language Modelling

Paper
Add Code

The Ordered Matrix Dirichlet for State-Space Models

1 code implementation • 8 Dec 2022 • Niklas Stoehr, Benjamin J. Radford, Ryan Cotterell, Aaron Schein

For discrete data, SSMs commonly do so through a state-to-action emission matrix and a state-to-state transition matrix.

Paper
Code

On the Effect of Anticipation on Reading Times

1 code implementation • 25 Nov 2022 • Tiago Pimentel, Clara Meister, Ethan G. Wilcox, Roger Levy, Ryan Cotterell

We assess the effect of anticipation on reading by comparing how well surprisal and contextual entropy predict reading times on four naturalistic reading datasets: two self-paced and two eye-tracking.

Paper
Code

Schrödinger's Bat: Diffusion Models Sometimes Generate Polysemous Words in Superposition

1 code implementation • 23 Nov 2022 • Jennifer C. White, Ryan Cotterell

Recent work has shown that despite their impressive capabilities, text-to-image diffusion models such as DALL-E 2 (Ramesh et al., 2022) can display strange behaviours when a prompt contains a word with multiple possible meanings, often generating images containing both senses of the word (Rassin et al., 2022).

Paper
Code

On Parsing as Tagging

1 code implementation • 14 Nov 2022 • Afra Amini, Ryan Cotterell

There have been many proposals to reduce constituency parsing to tagging in the literature.

Constituency Parsing

Paper
Code

The Architectural Bottleneck Principle

no code implementations • 11 Nov 2022 • Tiago Pimentel, Josef Valvoda, Niklas Stoehr, Ryan Cotterell

This shift in perspective leads us to propose a new principle for probing, the architectural bottleneck principle: In order to estimate how much information a given component could extract, a probe should look exactly like the component.

Open-Ended Question Answering

Paper
Add Code

Autoregressive Structured Prediction with Language Models

1 code implementation • 26 Oct 2022 • Tianyu Liu, Yuchen Jiang, Nicholas Monath, Ryan Cotterell, Mrinmaya Sachan

Recent years have seen a paradigm shift in NLP towards using pretrained language models ({PLM}) for a wide range of tasks.

Ranked #1 on Relation Extraction on CoNLL04 (RE+ Micro F1 metric)

Named Entity Recognition Named Entity Recognition (NER) +2

Paper
Code

A Bilingual Parallel Corpus with Discourse Annotations

1 code implementation • 26 Oct 2022 • Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Mrinmaya Sachan, Ryan Cotterell

The BWB corpus consists of Chinese novels translated by experts into English, and the annotated test set is designed to probe the ability of machine translation systems to model various discourse phenomena.

Document Level Machine Translation Machine Translation +2

Paper
Code

Investigating the Role of Centering Theory in the Context of Neural Coreference Resolution Systems

no code implementations • 26 Oct 2022 • Yuchen Eleanor Jiang, Ryan Cotterell, Mrinmaya Sachan

Our analysis further shows that contextualized embeddings contain much of the coherence information, which helps explain why CT can only provide little gains to modern neural coreference resolvers which make use of pretrained representations.

coreference-resolution World Knowledge

Paper
Add Code

Mutual Information Alleviates Hallucinations in Abstractive Summarization

2 code implementations • 24 Oct 2022 • Liam van der Poel, Ryan Cotterell, Clara Meister

Despite significant progress in the quality of language generated from abstractive summarization models, these models still exhibit the tendency to hallucinate, i. e., output content not supported by the source document.

Abstractive Text Summarization

Paper
Code

Log-linear Guardedness and its Implications

no code implementations • 18 Oct 2022 • Shauli Ravfogel, Yoav Goldberg, Ryan Cotterell

Methods for erasing human-interpretable concepts from neural representations that assume linearity have been found to be tractable and useful.

Paper
Add Code

Algorithms for Weighted Pushdown Automata

1 code implementation • 13 Oct 2022 • Alexandra Butoi, Brian DuSell, Tim Vieira, Ryan Cotterell, David Chiang

Weighted pushdown automata (WPDAs) are at the core of many natural language processing tasks, like syntax-based statistical machine translation and transition-based dependency parsing.

Machine Translation Transition-Based Dependency Parsing

Paper
Code

An Ordinal Latent Variable Model of Conflict Intensity

1 code implementation • 8 Oct 2022 • Niklas Stoehr, Lucas Torroba Hennigen, Josef Valvoda, Robert West, Ryan Cotterell, Aaron Schein

It is based only on the action category ("what") and disregards the subject ("who") and object ("to whom") of an event, as well as contextual information, like associated casualty count, that should contribute to the perception of an event's "intensity".

Event Extraction

Paper
Code

State-of-the-art generalisation research in NLP: A taxonomy and review

no code implementations • 6 Oct 2022 • Dieuwke Hupkes, Mario Giulianelli, Verna Dankers, Mikel Artetxe, Yanai Elazar, Tiago Pimentel, Christos Christodoulopoulos, Karim Lasri, Naomi Saphra, Arabella Sinclair, Dennis Ulmer, Florian Schottmann, Khuyagbaatar Batsuren, Kaiser Sun, Koustuv Sinha, Leila Khalatbari, Maria Ryskina, Rita Frieske, Ryan Cotterell, Zhijing Jin

We present a taxonomy for characterising and understanding generalisation research in NLP.

Paper
Add Code

Equivariant Transduction through Invariant Alignment

1 code implementation • COLING 2022 • Jennifer C. White, Ryan Cotterell

The ability to generalize compositionally is key to understanding the potentially infinite number of sentences that can be constructed in a human language from only a finite number of words.

Inductive Bias

Paper
Code

On the Intersection of Context-Free and Regular Languages

1 code implementation • 14 Sep 2022 • Clemente Pasti, Andreas Opedal, Tiago Pimentel, Tim Vieira, Jason Eisner, Ryan Cotterell

It shows, by a simple construction, that the intersection of a context-free language and a regular language is itself context-free.

Paper
Code

On the Role of Negative Precedent in Legal Outcome Prediction

1 code implementation • 17 Aug 2022 • Josef Valvoda, Ryan Cotterell, Simone Teufel

In contrast, we turn our focus to negative outcomes here, and introduce a new task of negative outcome prediction.

Paper
Code

Benchmarking Compositionality with Formal Languages

1 code implementation • COLING 2022 • Josef Valvoda, Naomi Saphra, Jonathan Rawski, Adina Williams, Ryan Cotterell

Recombining known primitive concepts into larger novel combinations is a quintessentially human cognitive capability.

Benchmarking Open-Ended Question Answering

Paper
Code

Visual Comparison of Language Model Adaptation

no code implementations • 17 Aug 2022 • Rita Sevastjanova, Eren Cakmak, Shauli Ravfogel, Ryan Cotterell, Mennatallah El-Assady

The simplicity of adapter training and composition comes along with new challenges, such as maintaining an overview of adapter properties and effectively comparing their produced embedding spaces.

Language Modelling

Paper
Add Code

Probing via Prompting

1 code implementation • NAACL 2022 • Jiaoda Li, Ryan Cotterell, Mrinmaya Sachan

We then examine the usefulness of a specific linguistic property for pre-training by removing the heads that are essential to that property and evaluating the resulting model's performance on language modeling.

Language Modelling

Paper
Code

The SIGMORPHON 2022 Shared Task on Morpheme Segmentation

1 code implementation • NAACL (SIGMORPHON) 2022 • Khuyagbaatar Batsuren, Gábor Bella, Aryaman Arora, Viktor Martinović, Kyle Gorman, Zdeněk Žabokrtský, Amarsanaa Ganbold, Šárka Dohnalová, Magda Ševčíková, Kateřina Pelegrinová, Fausto Giunchiglia, Ryan Cotterell, Ekaterina Vylomova

The SIGMORPHON 2022 shared task on morpheme segmentation challenged systems to decompose a word into a sequence of morphemes and covered most types of morphology: compounds, derivations, and inflections.

Ranked #8 on Morpheme Segmentaiton on UniMorph 4.0

Morpheme Segmentaiton Segmentation +1

Paper
Code

On the Usefulness of Embeddings, Clusters and Strings for Text Generator Evaluation

1 code implementation • 31 May 2022 • Tiago Pimentel, Clara Meister, Ryan Cotterell

As we show, however, this is not a tight approximation -- in either theory or practice.

Language Modelling Text Generation

Paper
Code

Naturalistic Causal Probing for Morpho-Syntax

1 code implementation • 14 May 2022 • Afra Amini, Tiago Pimentel, Clara Meister, Ryan Cotterell

Probing has become a go-to methodology for interpreting and analyzing deep neural models in natural language processing.

Sentence

Paper
Code

A Structured Span Selector

1 code implementation • NAACL 2022 • Tianyu Liu, Yuchen Eleanor Jiang, Ryan Cotterell, Mrinmaya Sachan

Many natural language processing tasks, e. g., coreference resolution and semantic role labeling, require selecting text spans and making decisions about them.

coreference-resolution Inductive Bias +1

Paper
Code

UniMorph 4.0: Universal Morphology

no code implementations • LREC 2022 • Khuyagbaatar Batsuren, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Kyle Gorman, Yustinus Ghanggo Ate, Maria Ryskina, Sabrina J. Mielke, Elena Budianskaya, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Lane, Mohit Raj, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Benoît Sagot, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay, Juan López Bautista, Gema Celeste Silva Villegas, Lucas Torroba Hennigen, Adam Ek, David Guriel, Peter Dirix, Jean-Philippe Bernardy, Andrey Scherbakov, Aziyana Bayyr-ool, Antonios Anastasopoulos, Roberto Zariquiey, Karina Sheifer, Sofya Ganieva, Hilaria Cruz, Ritván Karahóǧa, Stella Markantonatou, George Pavlidis, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Candy Angulo, Jatayu Baxi, Andrew Krizhanovsky, Natalia Krizhanovskaya, Elizabeth Salesky, Clara Vania, Sardana Ivanova, Jennifer White, Rowan Hall Maudslay, Josef Valvoda, Ran Zmigrod, Paula Czarnowska, Irene Nikkarinen, Aelita Salchak, Brijesh Bhatt, Christopher Straughn, Zoey Liu, Jonathan North Washington, Yuval Pinter, Duygu Ataman, Marcin Wolinski, Totok Suhardijanto, Anna Yablonskaya, Niklas Stoehr, Hossep Dolatian, Zahroh Nuriah, Shyam Ratan, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Aryaman Arora, Richard J. Hatcher, Ritesh Kumar, Jeremiah Young, Daria Rodionova, Anastasia Yemelina, Taras Andrushko, Igor Marchenko, Polina Mashkovtseva, Alexandra Serova, Emily Prud'hommeaux, Maria Nepomniashchaya, Fausto Giunchiglia, Eleanor Chodroff, Mans Hulden, Miikka Silfverberg, Arya D. McCarthy, David Yarowsky, Ryan Cotterell, Reut Tsarfaty, Ekaterina Vylomova

The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema.

Morphological Inflection

Paper
Add Code

Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models

1 code implementation • NAACL 2022 • Karolina Stańczak, Edoardo Ponti, Lucas Torroba Hennigen, Ryan Cotterell, Isabelle Augenstein

The success of multilingual pre-trained models is underpinned by their ability to learn representations shared by multiple languages even in absence of any explicit supervision.

Paper
Code

Exact Paired-Permutation Testing for Structured Test Statistics

1 code implementation • NAACL 2022 • Ran Zmigrod, Tim Vieira, Ryan Cotterell

However, practitioners rely on Monte Carlo approximation to perform this test due to a lack of a suitable exact algorithm.

Paper
Code

Probing for the Usage of Grammatical Number

no code implementations • ACL 2022 • Karim Lasri, Tiago Pimentel, Alessandro Lenci, Thierry Poibeau, Ryan Cotterell

We also find that BERT uses a separate encoding of grammatical number for nouns and verbs.

Paper
Add Code

Estimating the Entropy of Linguistic Distributions

no code implementations • ACL 2022 • Aryaman Arora, Clara Meister, Ryan Cotterell

Shannon entropy is often a quantity of interest to linguists studying the communicative capacity of human language.

Paper
Add Code

Analyzing Wrap-Up Effects through an Information-Theoretic Lens

no code implementations • ACL 2022 • Clara Meister, Tiago Pimentel, Thomas Hikaru Clark, Ryan Cotterell, Roger Levy

Numerous analyses of reading time (RT) data have been implemented -- all in an effort to better understand the cognitive processes driving reading comprehension.

Reading Comprehension Sentence

Paper
Add Code

On the probability-quality paradox in language generation

no code implementations • 31 Mar 2022 • Clara Meister, Gian Wiher, Tiago Pimentel, Ryan Cotterell

Specifically, we posit that human-like language should contain an amount of information (quantified as negative log-probability) that is close to the entropy of the distribution over natural strings.

Text Generation

Paper
Add Code

On Decoding Strategies for Neural Text Generators

no code implementations • 29 Mar 2022 • Gian Wiher, Clara Meister, Ryan Cotterell

For example, the nature of the diversity-quality trade-off in language generation is very task-specific; the length bias often attributed to beam search is not constant across tasks.

Machine Translation Story Generation

Paper
Add Code

Locally Typical Sampling

3 code implementations • 1 Feb 2022 • Clara Meister, Tiago Pimentel, Gian Wiher, Ryan Cotterell

Automatic and human evaluations show that, in comparison to nucleus and top-k sampling, locally typical sampling offers competitive performance (in both abstractive summarization and story generation) in terms of quality while consistently reducing degenerate repetitions.

Abstractive Text Summarization Story Generation

825

Paper
Code

Linear Adversarial Concept Erasure

2 code implementations • 28 Jan 2022 • Shauli Ravfogel, Michael Twiton, Yoav Goldberg, Ryan Cotterell

Modern neural models trained on textual data rely on pre-trained representations that emerge without direct supervision.

Paper
Code

Kernelized Concept Erasure

1 code implementation • 28 Jan 2022 • Shauli Ravfogel, Francisco Vargas, Yoav Goldberg, Ryan Cotterell

One prominent approach for the identification of concepts in neural representations is searching for a linear subspace whose erasure prevents the prediction of the concept from the representations.

Paper
Code

A Latent-Variable Model for Intrinsic Probing

2 code implementations • 20 Jan 2022 • Karolina Stańczak, Lucas Torroba Hennigen, Adina Williams, Ryan Cotterell, Isabelle Augenstein

The success of pre-trained contextualized representations has prompted researchers to analyze them for the presence of linguistic information.

Attribute

Paper
Code

A Word on Machine Ethics: A Response to Jiang et al. (2021)

no code implementations • 7 Nov 2021 • Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, Adina Williams

Ethics is one of the longest standing intellectual endeavors of humanity.

Ethics

Paper
Add Code

Probing as Quantifying Inductive Bias

1 code implementation • ACL 2022 • Alexander Immer, Lucas Torroba Hennigen, Vincent Fortuin, Ryan Cotterell

Such performance improvements have motivated researchers to quantify and understand the linguistic information encoded in these representations.

Bayesian Inference Inductive Bias

Paper
Code

A surprisal--duration trade-off across and within the world's languages

1 code implementation • 30 Sep 2021 • Tiago Pimentel, Clara Meister, Elizabeth Salesky, Simone Teufel, Damián Blasi, Ryan Cotterell

We thus conclude that there is strong evidence of a surprisal--duration trade-off in operation, both across and within the world's languages.

Paper
Code

On Homophony and Rényi Entropy

1 code implementation • EMNLP 2021 • Tiago Pimentel, Clara Meister, Simone Teufel, Ryan Cotterell

Homophony's widespread presence in natural languages is a controversial topic.

Paper
Code

Classifying Dyads for Militarized Conflict Analysis

1 code implementation • EMNLP 2021 • Niklas Stoehr, Lucas Torroba Hennigen, Samin Ahbab, Robert West, Ryan Cotterell

We do this by devising a set of textual and graph-based features which represent each of the causes.

Edge Classification

Paper
Code

Revisiting the Uniform Information Density Hypothesis

no code implementations • EMNLP 2021 • Clara Meister, Tiago Pimentel, Patrick Haller, Lena Jäger, Ryan Cotterell, Roger Levy

The uniform information density (UID) hypothesis posits a preference among language users for utterances structured such that information is distributed uniformly across a signal.

Linguistic Acceptability Sentence

Paper
Add Code

Conditional Poisson Stochastic Beam Search

1 code implementation • 22 Sep 2021 • Clara Meister, Afra Amini, Tim Vieira, Ryan Cotterell

In this work, we propose a new method for turning beam search into a stochastic process: Conditional Poisson stochastic beam search.

Paper
Code

A Plug-and-Play Method for Controlled Text Generation

1 code implementation • Findings (EMNLP) 2021 • Damian Pascual, Beni Egressy, Clara Meister, Ryan Cotterell, Roger Wattenhofer

Large pre-trained language models have repeatedly shown their ability to produce fluent text.

Sentence Story Generation

Paper
Code

Efficient Sampling of Dependency Structures

no code implementations • 14 Sep 2021 • Ran Zmigrod, Tim Vieira, Ryan Cotterell

Colbourn (1996)'s sampling algorithm has a running time of $\mathcal{O}(N^3)$, which is often greater than the mean hitting time of a directed graph.

Paper
Add Code

Searching for More Efficient Dynamic Programs

no code implementations • Findings (EMNLP) 2021 • Tim Vieira, Ryan Cotterell, Jason Eisner

To this end, we describe a set of program transformations, a simple metric for assessing the efficiency of a transformed program, and a heuristic search procedure to improve this metric.

Paper
Add Code

A Bayesian Framework for Information-Theoretic Probing

1 code implementation • EMNLP 2021 • Tiago Pimentel, Ryan Cotterell

Pimentel et al. (2020) recently analysed probing from an information-theoretic perspective.

Paper
Code

Differentiable Subset Pruning of Transformer Heads

2 code implementations • 10 Aug 2021 • Jiaoda Li, Ryan Cotterell, Mrinmaya Sachan

Multi-head attention, a collection of several attention mechanisms that independently attend to different parts of the input, is the key ingredient in the Transformer.

Machine Translation Natural Language Inference +1

Paper
Code

Towards Zero-shot Language Modeling

no code implementations • IJCNLP 2019 • Edoardo Maria Ponti, Ivan Vulić, Ryan Cotterell, Roi Reichart, Anna Korhonen

Motivated by this question, we aim at constructing an informative prior over neural weights, in order to adapt quickly to held-out languages in the task of character-level language modeling.

Language Modelling

Paper
Add Code

On Finding the K-best Non-projective Dependency Trees

1 code implementation • ACL 2021 • Ran Zmigrod, Tim Vieira, Ryan Cotterell

Furthermore, we present a novel extension of the algorithm for decoding the K-best dependency trees of a graph which are subject to a root constraint.

Dependency Parsing Sentence

Paper
Code

Determinantal Beam Search

no code implementations • ACL 2021 • Clara Meister, Martina Forster, Ryan Cotterell

Beam search is a go-to strategy for decoding neural sequence models.

Point Processes Text Generation

Paper
Add Code

SIGTYP 2021 Shared Task: Robust Spoken Language Identification

no code implementations • NAACL (SIGTYP) 2021 • Elizabeth Salesky, Badr M. Abdullah, Sabrina J. Mielke, Elena Klyachko, Oleg Serikov, Edoardo Ponti, Ritesh Kumar, Ryan Cotterell, Ekaterina Vylomova

While language identification is a fundamental speech and language processing task, for many languages and language families it remains a challenging task.

Domain Adaptation Language Identification +1

Paper
Add Code

Do Syntactic Probes Probe Syntax? Experiments with Jabberwocky Probing

no code implementations • NAACL 2021 • Rowan Hall Maudslay, Ryan Cotterell

One method of doing so, which is frequently cited to support the claim that models like BERT encode syntax, is called probing; probes are small supervised models trained to extract linguistic information from another model's output.

Paper
Add Code

Modeling the Unigram Distribution

1 code implementation • Findings (ACL) 2021 • Irene Nikkarinen, Tiago Pimentel, Damián E. Blasi, Ryan Cotterell

The unigram distribution is the non-contextual probability of finding a specific word form in a corpus.

Paper
Code

Examining the Inductive Bias of Neural Language Models with Artificial Languages

1 code implementation • ACL 2021 • Jennifer C. White, Ryan Cotterell

Since language models are used to model a wide variety of languages, it is natural to ask whether the neural architectures used for the task have inductive biases towards modeling particular types of languages.

Inductive Bias

Paper
Code

Is Sparse Attention more Interpretable?

no code implementations • ACL 2021 • Clara Meister, Stefan Lazov, Isabelle Augenstein, Ryan Cotterell

Sparse attention has been claimed to increase model interpretability under the assumption that it highlights influential inputs.

text-classification Text Classification

Paper
Add Code

Higher-order Derivatives of Weighted Finite-state Machines

1 code implementation • ACL 2021 • Ran Zmigrod, Tim Vieira, Ryan Cotterell

In the case of second-order derivatives, our scheme runs in the optimal $\mathcal{O}(A^2 N^4)$ time where $A$ is the alphabet size and $N$ is the number of states.

Paper
Code

On Finding the $K$-best Non-projective Dependency Trees

1 code implementation • 1 Jun 2021 • Ran Zmigrod, Tim Vieira, Ryan Cotterell

Furthermore, we present a novel extension of the algorithm for decoding the $K$-best dependency trees of a graph which are subject to a root constraint.

Dependency Parsing Sentence

Paper
Code

Language Model Evaluation Beyond Perplexity

no code implementations • ACL 2021 • Clara Meister, Ryan Cotterell

As concrete examples, text generated under the nucleus sampling scheme adheres more closely to the type--token relationship of natural language than text produced using standard ancestral sampling; text from LSTMs reflects the natural language distributions over length, stopwords, and symbols surprisingly well.

Language Modelling

Paper
Add Code

A Non-Linear Structural Probe

no code implementations • NAACL 2021 • Jennifer C. White, Tiago Pimentel, Naomi Saphra, Ryan Cotterell

Probes are models devised to investigate the encoding of knowledge -- e. g. syntactic structure -- in contextual representations.

Paper
Add Code

A Cognitive Regularizer for Language Modeling

no code implementations • ACL 2021 • Jason Wei, Clara Meister, Ryan Cotterell

The uniform information density (UID) hypothesis, which posits that speakers behaving optimally tend to distribute information uniformly across a linguistic signal, has gained traction in psycholinguistics as an explanation for certain syntactic, morphological, and prosodic choices.

Inductive Bias Language Modelling

Paper
Add Code

How (Non-)Optimal is the Lexicon?

no code implementations • NAACL 2021 • Tiago Pimentel, Irene Nikkarinen, Kyle Mahowald, Ryan Cotterell, Damián Blasi

Examining corpora from 7 typologically diverse languages, we use those upper bounds to quantify the lexicon's optimality and to explore the relative costs of major constraints on natural codes.

Paper
Add Code

Quantifying Gender Bias Towards Politicians in Cross-Lingual Language Models

1 code implementation • 15 Apr 2021 • Karolina Stańczak, Sagnik Ray Choudhury, Tiago Pimentel, Ryan Cotterell, Isabelle Augenstein

Recent research has demonstrated that large pre-trained language models reflect societal biases expressed in natural language.

Language Modelling Probing Language Models

Paper
Code

Finding Concept-specific Biases in Form--Meaning Associations

2 code implementations • NAACL 2021 • Tiago Pimentel, Brian Roark, Søren Wichmann, Ryan Cotterell, Damián Blasi

It is not a new idea that there are small, cross-linguistic associations between the forms and meanings of words.

Paper
Code

BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation

2 code implementations • NAACL 2022 • Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Jian Yang, Haoyang Huang, Rico Sennrich, Ryan Cotterell, Mrinmaya Sachan, Ming Zhou

Standard automatic metrics, e. g. BLEU, are not reliable for document-level MT evaluation.

Document Level Machine Translation Machine Translation +2

Paper
Code

Searching for Search Errors in Neural Morphological Inflection

no code implementations • EACL 2021 • Martina Forster, Clara Meister, Ryan Cotterell

Yet, on word-level tasks, exact inference of these models reveals the empty string is often the global optimum.

Morphological Inflection Text Generation

Paper
Add Code

Differentiable Generative Phonology

1 code implementation • 10 Feb 2021 • Shijie Wu, Edoardo Maria Ponti, Ryan Cotterell

As the main contribution of our work, we implement the phonological generative system as a neural model differentiable end-to-end, rather than as a set of rules or constraints.

Paper
Code

Disambiguatory Signals are Stronger in Word-initial Positions

1 code implementation • EACL 2021 • Tiago Pimentel, Ryan Cotterell, Brian Roark

Psycholinguistic studies of human word processing and lexical access provide ample evidence of the preferred nature of word-initial versus word-final segments, e. g., in terms of attention paid by listeners (greater) or the likelihood of reduction by speakers (lower).

Informativeness

Paper
Code

Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs

3 code implementations • 30 Nov 2020 • Emanuele Bugliarello, Ryan Cotterell, Naoaki Okazaki, Desmond Elliott

Large-scale pretraining and task-specific fine-tuning is now the standard methodology for many tasks in computer vision and natural language processing.

111

Paper
Code

Morphologically Aware Word-Level Translation

no code implementations • COLING 2020 • Paula Czarnowska, Sebastian Ruder, Ryan Cotterell, Ann Copestake

We propose a novel morphologically aware probability model for bilingual lexicon induction, which jointly models lexeme translation and inflectional morphology in a structured way.

Bilingual Lexicon Induction Translation

Paper
Add Code

SIGTYP 2020 Shared Task: Prediction of Typological Features

no code implementations • EMNLP (SIGTYP) 2020 • Johannes Bjerva, Elizabeth Salesky, Sabrina J. Mielke, Aditi Chaudhary, Giuseppe G. A. Celano, Edoardo M. Ponti, Ekaterina Vylomova, Ryan Cotterell, Isabelle Augenstein

Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 2013) contain information about linguistic properties of the world's languages.

Cross-Lingual Transfer Transfer Learning

Paper
Add Code

Investigating Cross-Linguistic Adjective Ordering Tendencies with a Latent-Variable Model

no code implementations • EMNLP 2020 • Jun Yen Leung, Guy Emerson, Ryan Cotterell

Across languages, multiple consecutive adjectives modifying a noun (e. g. "the big red dog") follow certain unmarked ordering rules.

Paper
Add Code

Intrinsic Probing through Dimension Selection

1 code implementation • EMNLP 2020 • Lucas Torroba Hennigen, Adina Williams, Ryan Cotterell

Most modern NLP systems make use of pre-trained contextual representations that attain astonishingly high performance on a variety of tasks.

Word Embeddings

Paper
Code

Please Mind the Root: Decoding Arborescences for Dependency Parsing

1 code implementation • EMNLP 2020 • Ran Zmigrod, Tim Vieira, Ryan Cotterell

The connection between dependency trees and spanning trees is exploited by the NLP community to train and to decode graph-based dependency parsers.

Dependency Parsing

Paper
Code

If beam search is the answer, what was the question?

1 code implementation • EMNLP 2020 • Clara Meister, Tim Vieira, Ryan Cotterell

This implies that the MAP objective alone does not express the properties we desire in text, which merits the question: if beam search is the answer, what was the question?

Machine Translation Text Generation +1

Paper
Code

Speakers Fill Lexical Semantic Gaps with Context

1 code implementation • EMNLP 2020 • Tiago Pimentel, Rowan Hall Maudslay, Damián Blasi, Ryan Cotterell

For a language to be clear and efficiently encoded, we posit that the lexical ambiguity of a word type should correlate with how much information context provides about it, on average.

Paper
Code

Pareto Probing: Trading Off Accuracy for Complexity

1 code implementation • EMNLP 2020 • Tiago Pimentel, Naomi Saphra, Adina Williams, Ryan Cotterell

In our contribution to this discussion, we argue for a probe metric that reflects the fundamental trade-off between probe complexity and performance: the Pareto hypervolume.

Dependency Parsing

Paper
Code

Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation

1 code implementation • EMNLP 2020 • Francisco Vargas, Ryan Cotterell

Their method takes pre-trained word embeddings as input and attempts to isolate a linear subspace that captures most of the gender bias in the embeddings.

Word Embeddings

Paper
Code

Efficient Computation of Expectations under Spanning Tree Distributions

no code implementations • 29 Aug 2020 • Ran Zmigrod, Tim Vieira, Ryan Cotterell

We propose unified algorithms for the important cases of first-order expectations and second-order expectations in edge-factored, non-projective spanning-tree models.

Sentence

Paper
Add Code

Best-First Beam Search

1 code implementation • 8 Jul 2020 • Clara Meister, Tim Vieira, Ryan Cotterell

Decoding for many NLP tasks requires an effective heuristic algorithm for approximating exact search since the problem of searching the full output space is often intractable, or impractical in many settings.

Paper
Code

Metaphor Detection using Context and Concreteness

no code implementations • WS 2020 • Rowan Hall Maudslay, Tiago Pimentel, Ryan Cotterell, Simone Teufel

We report the results of our system on the Metaphor Detection Shared Task at the Second Workshop on Figurative Language Processing 2020.

Paper
Add Code

SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection

1 code implementation • WS 2020 • Ekaterina Vylomova, Jennifer White, Elizabeth Salesky, Sabrina J. Mielke, Shijie Wu, Edoardo Ponti, Rowan Hall Maudslay, Ran Zmigrod, Josef Valvoda, Svetlana Toldova, Francis Tyers, Elena Klyachko, Ilya Yegorov, Natalia Krizhanovsky, Paula Czarnowska, Irene Nikkarinen, Andrew Krizhanovsky, Tiago Pimentel, Lucas Torroba Hennigen, Christo Kirov, Garrett Nicolai, Adina Williams, Antonios Anastasopoulos, Hilaria Cruz, Eleanor Chodroff, Ryan Cotterell, Miikka Silfverberg, Mans Hulden

Systems were developed using data from 45 languages and just 5 language families, fine-tuned with data from an additional 45 languages and 10 language families (13 in total), and evaluated on all 90 languages.

Hallucination Morphological Inflection

Paper
Code

A Corpus for Large-Scale Phonetic Typology

no code implementations • ACL 2020 • Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W. black, Jason Eisner

A major hurdle in data-driven research on typology is having sufficient data in many languages to draw meaningful conclusions.

Paper
Add Code

Applying the Transformer to Character-level Transduction

2 code implementations • EACL 2021 • Shijie Wu, Ryan Cotterell, Mans Hulden

The transformer has been shown to outperform recurrent neural network-based sequence-to-sequence models in various word-level NLP tasks.

Morphological Inflection Transliteration

Paper
Code

Phonotactic Complexity and its Trade-offs

1 code implementation • TACL 2020 • Tiago Pimentel, Brian Roark, Ryan Cotterell

We present methods for calculating a measure of phonotactic complexity---bits per phoneme---that permits a straightforward cross-linguistic comparison.

Paper
Code

It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information

1 code implementation • ACL 2020 • Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos, Ryan Cotterell, Naoaki Okazaki

The performance of neural machine translation systems is commonly evaluated in terms of BLEU.

Machine Translation NMT +1

Paper
Code

The Paradigm Discovery Problem

1 code implementation • ACL 2020 • Alexander Erdmann, Micha Elsner, Shijie Wu, Ryan Cotterell, Nizar Habash

Our benchmark system first makes use of word embeddings and string similarity to cluster forms by cell and by paradigm.

Clustering Word Embeddings

Paper
Code

A Tale of a Probe and a Parser

1 code implementation • ACL 2020 • Rowan Hall Maudslay, Josef Valvoda, Tiago Pimentel, Adina Williams, Ryan Cotterell

One such probe is the structural probe (Hewitt and Manning, 2019), designed to quantify the extent to which syntactic information is encoded in contextualised word representations.

Contextualised Word Representations

Paper
Code

On the Relationships Between the Grammatical Genders of Inanimate Nouns and Their Co-Occurring Adjectives and Verbs

no code implementations • 3 May 2020 • Adina Williams, Ryan Cotterell, Lawrence Wolf-Sonkin, Damián Blasi, Hanna Wallach

We also find that there are statistically significant relationships between the grammatical genders of inanimate nouns and the verbs that take those nouns as direct objects, as indirect objects, and as subjects.

Paper
Add Code

Generalized Entropy Regularization or: There's Nothing Special about Label Smoothing

no code implementations • ACL 2020 • Clara Meister, Elizabeth Salesky, Ryan Cotterell

Prior work has explored directly regularizing the output distributions of probabilistic models to alleviate peaky (i. e. over-confident) predictions, a common sign of overfitting.

Text Generation

Paper
Add Code

UniMorph 3.0: Universal Morphology

no code implementations • LREC 2020 • Arya D. McCarthy, Christo Kirov, Matteo Grella, Amrit Nidhi, Patrick Xia, Kyle Gorman, Ekaterina Vylomova, Sabrina J. Mielke, Garrett Nicolai, Miikka Silfverberg, Timofey Arkhangelskiy, Nataly Krizhanovsky, Andrew Krizhanovsky, Elena Klyachko, Alexey Sorokin, John Mansfield, Valts Ern{\v{s}}treits, Yuval Pinter, Cass Jacobs, ra L., Ryan Cotterell, Mans Hulden, David Yarowsky

Paper
Add Code

Predicting Declension Class from Form and Meaning

1 code implementation • ACL 2020 • Adina Williams, Tiago Pimentel, Arya D. McCarthy, Hagen Blix, Eleanor Chodroff, Ryan Cotterell

We find for two Indo-European languages (Czech and German) that form and meaning respectively share significant amounts of information with class (and contribute additional information above and beyond gender).

Paper
Code

Information-Theoretic Probing for Linguistic Structure

1 code implementation • ACL 2020 • Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay, Ran Zmigrod, Adina Williams, Ryan Cotterell

The success of neural networks on a diverse set of NLP tasks has led researchers to question how much these networks actually ``know'' about natural language.

Word Embeddings

Paper
Code

Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages

1 code implementation • 30 Jan 2020 • Edoardo M. Ponti, Ivan Vulić, Ryan Cotterell, Marinela Parovic, Roi Reichart, Anna Korhonen

In this work, we propose a Bayesian generative model for the space of neural parameters.

named-entity-recognition Named Entity Recognition +7

Paper
Code

Morphological Segmentation Inside-Out

no code implementations • EMNLP 2016 • Ryan Cotterell, Arun Kumar, Hinrich Schütze

Morphological segmentation has traditionally been modeled with non-hierarchical models, which yield flat segmentations as output.

Morphological Analysis Segmentation

Paper
Add Code

Weird Inflects but OK: Making Sense of Morphological Generation Errors

no code implementations • CONLL 2019 • Kyle Gorman, Arya D. McCarthy, Ryan Cotterell, Ekaterina Vylomova, Miikka Silfverberg, Magdalena Markowska

We conduct a manual error analysis of the CoNLL-SIGMORPHON Shared Task on Morphological Reinflection.

Text Generation

Paper
Add Code

Quantifying the Semantic Core of Gender Systems

no code implementations • IJCNLP 2019 • Adina Williams, Ryan Cotterell, Lawrence Wolf-Sonkin, Damián Blasi, Hanna Wallach

To that end, we use canonical correlation analysis to correlate the grammatical gender of inanimate nouns with an externally grounded definition of their lexical semantics.

Paper
Add Code

The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection

no code implementations • WS 2019 • Arya D. McCarthy, Ekaterina Vylomova, Shijie Wu, Chaitanya Malaviya, Lawrence Wolf-Sonkin, Garrett Nicolai, Christo Kirov, Miikka Silfverberg, Sabrina J. Mielke, Jeffrey Heinz, Ryan Cotterell, Mans Hulden

The SIGMORPHON 2019 shared task on cross-lingual transfer and contextual analysis in morphology examined transfer learning of inflection between 100 language pairs, as well as contextual lemmatization and morphosyntactic description in 66 languages.

Cross-Lingual Transfer Lemmatization +3

Paper
Add Code

Don't Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction

no code implementations • IJCNLP 2019 • Paula Czarnowska, Sebastian Ruder, Edouard Grave, Ryan Cotterell, Ann Copestake

Human translators routinely have to translate rare inflections of words - due to the Zipfian distribution of words in a language.

Bilingual Lexicon Induction Translation

Paper
Add Code

Examining Gender Bias in Languages with Grammatical Gender

1 code implementation • IJCNLP 2019 • Pei Zhou, Weijia Shi, Jieyu Zhao, Kuan-Hao Huang, Muhao Chen, Ryan Cotterell, Kai-Wei Chang

Recent studies have shown that word embeddings exhibit gender bias inherited from the training corpora.

Translation Word Embeddings +2

Paper
Code

It's All in the Name: Mitigating Gender Bias with Name-Based Counterfactual Data Substitution

no code implementations • IJCNLP 2019 • Rowan Hall Maudslay, Hila Gonen, Ryan Cotterell, Simone Teufel

An alternative approach is Counterfactual Data Augmentation (CDA), in which a corpus is duplicated and augmented to remove bias, e. g. by swapping all inherently-gendered words in the copy.

counterfactual Data Augmentation +1

Paper
Add Code

Rethinking Phonotactic Complexity

no code implementations • WS 2019 • Tiago Pimentel, Brian Roark, Ryan Cotterell

In this work, we propose the use of phone-level language models to estimate phonotactic complexity{---}measured in bits per phoneme{---}which makes cross-linguistic comparison straightforward.

Paper
Add Code

Morphological Word Embeddings

no code implementations • 4 Jul 2019 • Ryan Cotterell, Hinrich Schütze

Linguistic similarity is multi-faceted.

Word Embeddings

Paper
Add Code

On the Distribution of Deep Clausal Embeddings: A Large Cross-linguistic Study

no code implementations • ACL 2019 • Damian Blasi, Ryan Cotterell, Lawrence Wolf-Sonkin, Sabine Stoll, Balthasar Bickel, Marco Baroni

Embedding a clause inside another ({``}the girl [who likes cars [that run fast]] has arrived{''}) is a fundamental resource that has been argued to be a key driver of linguistic expressiveness.

Paper
Add Code

Morphological Irregularity Correlates with Frequency

1 code implementation • ACL 2019 • Shijie Wu, Ryan Cotterell, Timothy J. O'Donnell

We present a study of morphological irregularity.

Paper
Code

Uncovering Probabilistic Implications in Typological Knowledge Bases

no code implementations • ACL 2019 • Johannes Bjerva, Yova Kementchedjhieva, Ryan Cotterell, Isabelle Augenstein

The study of linguistic typology is rooted in the implications we find between linguistic features, such as the fact that languages with object-verb word ordering tend to have post-positions.

Knowledge Base Population

Paper
Add Code

Meaning to Form: Measuring Systematicity as Information

1 code implementation • ACL 2019 • Tiago Pimentel, Arya D. McCarthy, Damián E. Blasi, Brian Roark, Ryan Cotterell

A longstanding debate in semiotics centers on the relationship between linguistic signs and their corresponding semantics: is there an arbitrary relationship between a word form and its meaning, or does some systematic phenomenon pervade?

Paper
Code

Unsupervised Discovery of Gendered Language through Latent-Variable Modeling

no code implementations • ACL 2019 • Alexander Hoyle, Wolf-Sonkin, Hanna Wallach, Isabelle Augenstein, Ryan Cotterell

Studying the ways in which language is gendered has long been an area of interest in sociolinguistics.

Paper
Add Code

Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology

no code implementations • ACL 2019 • Ran Zmigrod, Sabrina J. Mielke, Hanna Wallach, Ryan Cotterell

Gender stereotypes are manifest in most of the world's languages and are consequently propagated or amplified by NLP systems.

counterfactual Data Augmentation

Paper
Add Code

What Kind of Language Is Hard to Language-Model?

no code implementations • ACL 2019 • Sabrina J. Mielke, Ryan Cotterell, Kyle Gorman, Brian Roark, Jason Eisner

Trying to answer the question of what features difficult languages have in common, we try and fail to reproduce our earlier (Cotterell et al., 2018) observation about morphological complexity and instead reveal far simpler statistics of the data that seem to drive complexity in a much larger sample.

Language Modelling Sentence

Paper
Add Code

Exact Hard Monotonic Attention for Character-Level Transduction

2 code implementations • ACL 2019 • Shijie Wu, Ryan Cotterell

Our models achieve state-of-the-art performance on morphological inflection.

Hard Attention Inductive Bias +1

Paper
Code

Contextualization of Morphological Inflection

no code implementations • NAACL 2019 • Ekaterina Vylomova, Ryan Cotterell, Timothy Baldwin, Trevor Cohn, Jason Eisner

Critical to natural language generation is the production of correctly inflected text.

Morphological Inflection Sentence +1

Paper
Add Code

Gender Bias in Contextualized Word Embeddings

2 code implementations • NAACL 2019 • Jieyu Zhao, Tianlu Wang, Mark Yatskar, Ryan Cotterell, Vicente Ordonez, Kai-Wei Chang

In this paper, we quantify, analyze and mitigate gender bias exhibited in ELMo's contextualized word vectors.

Word Embeddings

Paper
Code

Combining Sentiment Lexica with a Multi-View Variational Autoencoder

1 code implementation • NAACL 2019 • Alexander Hoyle, Lawrence Wolf-Sonkin, Hanna Wallach, Ryan Cotterell, Isabelle Augenstein

When assigning quantitative labels to a dataset, different methodologies may rely on different scales.

General Classification Sentiment Analysis +2

Paper
Code

A Simple Joint Model for Improved Contextual Neural Lemmatization

no code implementations • NAACL 2019 • Chaitanya Malaviya, Shijie Wu, Ryan Cotterell

English verbs have multiple forms.

LEMMA Lemmatization +1

Paper
Add Code

A Probabilistic Generative Model of Linguistic Typology

1 code implementation • NAACL 2019 • Johannes Bjerva, Yova Kementchedjhieva, Ryan Cotterell, Isabelle Augenstein

In the principles-and-parameters framework, the structural features of languages depend on parameters that may be toggled on or off, with a single parameter often dictating the status of multiple features.

Paper
Code

On the Idiosyncrasies of the Mandarin Chinese Classifier System

no code implementations • NAACL 2019 • Shijia Liu, Hongyuan Mei, Adina Williams, Ryan Cotterell

While idiosyncrasies of the Chinese classifier system have been a richly studied topic among linguists (Adams and Conklin, 1973; Erbaugh, 1986; Lakoff, 1986), not much work has been done to quantify them with statistical methods.

Paper
Add Code

UniMorph 2.0: Universal Morphology

3 code implementations • LREC 2018 • Christo Kirov, Ryan Cotterell, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sabrina J. Mielke, Arya D. McCarthy, Sandra Kübler, David Yarowsky, Jason Eisner, Mans Hulden

The Universal Morphology UniMorph project is a collaborative effort to improve how NLP handles complex morphology across the world's languages.

LEMMA

Paper
Code

The CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection

no code implementations • CONLL 2018 • Ryan Cotterell, Christo Kirov, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Arya D. McCarthy, Katharina Kann, Sabrina J. Mielke, Garrett Nicolai, Miikka Silfverberg, David Yarowsky, Jason Eisner, Mans Hulden

Apart from extending the number of languages involved in earlier supervised tasks of generating inflected forms, this year the shared task also featured a new second task which asked participants to inflect words in sentential context, similar to a cloze task.

LEMMA Task 2

Paper
Add Code

Marrying Universal Dependencies and Universal Morphology

no code implementations • WS 2018 • Arya D. McCarthy, Miikka Silfverberg, Ryan Cotterell, Mans Hulden, David Yarowsky

The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects each present schemata for annotating the morphosyntactic details of language.

Paper
Add Code

Generalizing Procrustes Analysis for Better Bilingual Dictionary Induction

1 code implementation • CONLL 2018 • Yova Kementchedjhieva, Sebastian Ruder, Ryan Cotterell, Anders Søgaard

Most recent approaches to bilingual dictionary induction find a linear alignment between the word vector spaces of two languages.

Paper
Code

Hard Non-Monotonic Attention for Character-Level Transduction

2 code implementations • EMNLP 2018 • Shijie Wu, Pamela Shapiro, Ryan Cotterell

We compare soft and hard non-monotonic attention experimentally and find that the exact algorithm significantly improves performance over the stochastic approximation and outperforms soft attention.

Hard Attention Image Captioning

Paper
Code

A Discriminative Latent-Variable Model for Bilingual Lexicon Induction

1 code implementation • EMNLP 2018 • Sebastian Ruder, Ryan Cotterell, Yova Kementchedjhieva, Anders Søgaard

We introduce a novel discriminative latent variable model for bilingual lexicon induction.

Bilingual Lexicon Induction

Paper
Code

Recurrent Neural Networks in Linguistic Theory: Revisiting Pinker and Prince (1988) and the Past Tense Debate

3 code implementations • TACL 2018 • Christo Kirov, Ryan Cotterell

We suggest that the empirical performance of modern networks warrants a re-examination of their utility in linguistic and cognitive modeling.

Paper
Code

On the Complexity and Typology of Inflectional Morphological Systems

no code implementations • TACL 2019 • Ryan Cotterell, Christo Kirov, Mans Hulden, Jason Eisner

We quantify the linguistic complexity of different languages' morphological systems.

Paper
Add Code

A Deep Generative Model of Vowel Formant Typology

no code implementations • NAACL 2018 • Ryan Cotterell, Jason Eisner

What makes some types of languages more probable than others?

Paper
Add Code

Explaining and Generalizing Back-Translation through Wake-Sleep

no code implementations • 12 Jun 2018 • Ryan Cotterell, Julia Kreutzer

Back-translation has become a commonly employed heuristic for semi-supervised neural machine translation.

Machine Translation Translation

Paper
Add Code

A Structured Variational Autoencoder for Contextual Morphological Inflection

2 code implementations • ACL 2018 • Lawrence Wolf-Sonkin, Jason Naradowsky, Sabrina J. Mielke, Ryan Cotterell

Statistical morphological inflectors are typically trained on fully supervised, type-level data.

Morphological Inflection Variational Inference

Paper
Code

Are All Languages Equally Hard to Language-Model?

no code implementations • NAACL 2018 • Ryan Cotterell, Sabrina J. Mielke, Jason Eisner, Brian Roark

For general modeling methods applied to diverse languages, a natural question is: how well should we expect our models to work on languages with differing typological profiles?

Language Modelling

Paper
Add Code

Unsupervised Disambiguation of Syncretism in Inflected Lexicons

no code implementations • NAACL 2018 • Ryan Cotterell, Christo Kirov, Sabrina J. Mielke, Jason Eisner

Lexical ambiguity makes it difficult to compute various useful statistics of a corpus.

Paper
Add Code

On the Diachronic Stability of Irregularity in Inflectional Morphology

no code implementations • 23 Apr 2018 • Ryan Cotterell, Christo Kirov, Mans Hulden, Jason Eisner

Many languages' inflectional morphological systems are replete with irregulars, i. e., words that do not seem to follow standard inflectional rules.

Relation

Paper
Add Code

Cross-lingual Character-Level Neural Morphological Tagging

no code implementations • EMNLP 2017 • Ryan Cotterell, Georg Heigold

Even for common NLP tasks, sufficient supervision is not available in many languages {--} morphological tagging is no exception.

Language Modelling Morphological Tagging +2

Paper
Add Code

Paradigm Completion for Derivational Morphology

no code implementations • EMNLP 2017 • Ryan Cotterell, Ekaterina Vylomova, Huda Khayrallah, Christo Kirov, David Yarowsky

The generation of complex derived word forms has been an overlooked problem in NLP; we fill this gap by applying neural sequence-to-sequence models to the task.

Paper
Add Code

Cross-lingual, Character-Level Neural Morphological Tagging

no code implementations • 30 Aug 2017 • Ryan Cotterell, Georg Heigold

Even for common NLP tasks, sufficient supervision is not available in many languages -- morphological tagging is no exception.

Morphological Tagging Transfer Learning

Paper
Add Code

Frame-Based Continuous Lexical Semantics through Exponential Family Tensor Factorization and Semantic Proto-Roles

1 code implementation • SEMEVAL 2017 • Francis Ferraro, Adam Poliak, Ryan Cotterell, Benjamin Van Durme

We study how different frame annotations complement one another when learning continuous lexical semantics.

Paper
Code

CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages

no code implementations • CONLL 2017 • Ryan Cotterell, Christo Kirov, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sandra Kübler, David Yarowsky, Jason Eisner, Mans Hulden

In sub-task 2, systems were given a lemma and some of its specific inflected forms, and asked to complete the inflectional paradigm by predicting all of the remaining inflected forms.

Data Augmentation Inductive Bias +2

Paper
Add Code

Probabilistic Typology: Deep Generative Models of Vowel Inventories

no code implementations • ACL 2017 • Ryan Cotterell, Jason Eisner

Linguistic typology studies the range of structures present in human language.

Point Processes

Paper
Add Code

Morphological Analysis of the Dravidian Language Family

no code implementations • EACL 2017 • Arun Kumar, Ryan Cotterell, Llu{\'\i}s Padr{\'o}, Antoni Oliver

The Dravidian languages are one of the most widely spoken language families in the world, yet there are very few annotated resources available to NLP researchers.

Morphological Analysis Segmentation

Paper
Add Code

A Rich Morphological Tagger for English: Exploring the Cross-Linguistic Tradeoff Between Morphology and Syntax

no code implementations • EACL 2017 • Christo Kirov, John Sylak-Glassman, Rebecca Knowles, Ryan Cotterell, Matt Post

A traditional claim in linguistics is that all human languages are equally expressive{---}able to convey the same wide range of meanings.

Dependency Parsing Machine Translation +3

Paper
Add Code

Neural Graphical Models over Strings for Principal Parts Morphological Paradigm Completion

no code implementations • EACL 2017 • Ryan Cotterell, John Sylak-Glassman, Christo Kirov

Many of the world{'}s languages contain an abundance of inflected forms for each lexeme.

Morphological Analysis

Paper
Add Code

Explaining and Generalizing Skip-Gram through Exponential Family Principal Component Analysis

no code implementations • EACL 2017 • Ryan Cotterell, Adam Poliak, Benjamin Van Durme, Jason Eisner

The popular skip-gram model induces word embeddings by exploiting the signal from word-context coocurrence.

Word Embeddings

Paper
Add Code

One-Shot Neural Cross-Lingual Transfer for Paradigm Completion

no code implementations • ACL 2017 • Katharina Kann, Ryan Cotterell, Hinrich Schütze

We present a novel cross-lingual transfer method for paradigm completion, the task of mapping a lemma to its inflected forms, using a neural encoder-decoder model, the state of the art for the monolingual task.

Cross-Lingual Transfer LEMMA +1