Search Results for author: Shijie Wu

Found 23 papers, 14 papers with code

SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages

no code implementations • ACL (SIGMORPHON) 2021 • Tiago Pimentel, Maria Ryskina, Sabrina J. Mielke, Shijie Wu, Eleanor Chodroff, Brian Leonard, Garrett Nicolai, Yustinus Ghanggo Ate, Salam Khalifa, Nizar Habash, Charbel El-Khaissi, Omer Goldman, Michael Gasser, William Lane, Matt Coler, Arturo Oncevay, Jaime Rafael Montoya Samame, Gema Celeste Silva Villegas, Adam Ek, Jean-Philippe Bernardy, Andrey Shcherbakov, Aziyana Bayyr-ool, Karina Sheifer, Sofya Ganieva, Matvey Plugaryov, Elena Klyachko, Ali Salehi, Andrew Krizhanovsky, Natalia Krizhanovsky, Clara Vania, Sardana Ivanova, Aelita Salchak, Christopher Straughn, Zoey Liu, Jonathan North Washington, Duygu Ataman, Witold Kieraś, Marcin Woliński, Totok Suhardijanto, Niklas Stoehr, Zahroh Nuriah, Shyam Ratan, Francis M. Tyers, Edoardo M. Ponti, Grant Aiton, Richard J. Hatcher, Emily Prud'hommeaux, Ritesh Kumar, Mans Hulden, Botond Barta, Dorina Lakatos, Gábor Szolnok, Judit Ács, Mohit Raj, David Yarowsky, Ryan Cotterell, Ben Ambridge, Ekaterina Vylomova

This year's iteration of the SIGMORPHON Shared Task on morphological reinflection focuses on typological diversity and cross-lingual variation of morphosyntactic features.

Paper
Add Code

MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies

1 code implementation • 26 May 2023 • Shiyue Zhang, Shijie Wu, Ozan Irsoy, Steven Lu, Mohit Bansal, Mark Dredze, David Rosenberg

Autoregressive language models are trained by minimizing the cross-entropy of the model distribution Q relative to the data distribution P -- that is, minimizing the forward cross-entropy, which is equivalent to maximum likelihood estimation (MLE).

Paper
Code

Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning

no code implementations • 25 May 2023 • Genta Indra Winata, Lingjue Xie, Karthik Radhakrishnan, Shijie Wu, Xisen Jin, Pengxiang Cheng, Mayank Kulkarni, Daniel Preotiuc-Pietro

Real-life multilingual systems should be able to efficiently incorporate new languages as data distributions fed to the system evolve and shift over time.

Continual Learning Scheduling

Paper
Add Code

BloombergGPT: A Large Language Model for Finance

no code implementations • 30 Mar 2023 • Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, Mark Dredze, Sebastian Gehrmann, Prabhanjan Kambadur, David Rosenberg, Gideon Mann

The use of NLP in the realm of financial technology is broad and complex, with applications ranging from sentiment analysis and named entity recognition to question answering.

Ranked #1 on Multiple Choice Question Answering (MCQA) on BIG-bench (Hyperbaton)

Causal Judgment Date Understanding +21

Paper
Add Code

BoundaryFace: A mining framework with noise label self-correction for Face Recognition

1 code implementation • 10 Oct 2022 • Shijie Wu, Xun Gong

Specifically, a closed-set noise label self-correction module is put forward, making this framework work well on datasets containing a lot of label noise.

Face Recognition

Paper
Code

How Do Multilingual Encoders Learn Cross-lingual Representation?

no code implementations • 12 Jul 2022 • Shijie Wu

We also look at how to inject different cross-lingual signals into multilingual encoders, and the optimization behavior of cross-lingual transfer with these models.

Cross-Lingual Transfer Multilingual NLP +1

Paper
Add Code

Zero-shot Cross-lingual Transfer is Under-specified Optimization

1 code implementation • RepL4NLP (ACL) 2022 • Shijie Wu, Benjamin Van Durme, Mark Dredze

Pretrained multilingual encoders enable zero-shot cross-lingual transfer, but often produce unreliable models that exhibit high performance variance on the target language.

Zero-Shot Cross-Lingual Transfer

Paper
Code

Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction

2 code implementations • EMNLP 2021 • Mahsa Yarmohammadi, Shijie Wu, Marc Marone, Haoran Xu, Seth Ebner, Guanghui Qin, Yunmo Chen, Jialiang Guo, Craig Harman, Kenton Murray, Aaron Steven White, Mark Dredze, Benjamin Van Durme

Zero-shot cross-lingual information extraction (IE) describes the construction of an IE model for some target language, given existing annotations exclusively in some other language, typically English.

Dependency Parsing Event Extraction +4

Paper
Code

Differentiable Generative Phonology

1 code implementation • 10 Feb 2021 • Shijie Wu, Edoardo Maria Ponti, Ryan Cotterell

As the main contribution of our work, we implement the phonological generative system as a neural model differentiable end-to-end, rather than as a set of rules or constraints.

Paper
Code

Do Explicit Alignments Robustly Improve Multilingual Encoders?

1 code implementation • EMNLP 2020 • Shijie Wu, Mark Dredze

Multilingual BERT (mBERT), XLM-RoBERTa (XLMR) and other unsupervised multilingual encoders can effectively learn cross-lingual representation.

Paper
Code

Which *BERT? A Survey Organizing Contextualized Encoders

no code implementations • EMNLP 2020 • Patrick Xia, Shijie Wu, Benjamin Van Durme

Pretrained contextualized text encoders are now a staple of the NLP community.

Representation Learning

Paper
Add Code

The SIGMORPHON 2020 Shared Task on Multilingual Grapheme-to-Phoneme Conversion

no code implementations • WS 2020 • Kyle Gorman, Lucas F.E. Ashby, Aaron Goyzueta, Arya McCarthy, Shijie Wu, Daniel You

We describe the design and findings of the SIGMORPHON 2020 shared task on multilingual grapheme-to-phoneme conversion.

Paper
Add Code

SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection

1 code implementation • WS 2020 • Ekaterina Vylomova, Jennifer White, Elizabeth Salesky, Sabrina J. Mielke, Shijie Wu, Edoardo Ponti, Rowan Hall Maudslay, Ran Zmigrod, Josef Valvoda, Svetlana Toldova, Francis Tyers, Elena Klyachko, Ilya Yegorov, Natalia Krizhanovsky, Paula Czarnowska, Irene Nikkarinen, Andrew Krizhanovsky, Tiago Pimentel, Lucas Torroba Hennigen, Christo Kirov, Garrett Nicolai, Adina Williams, Antonios Anastasopoulos, Hilaria Cruz, Eleanor Chodroff, Ryan Cotterell, Miikka Silfverberg, Mans Hulden

Systems were developed using data from 45 languages and just 5 language families, fine-tuned with data from an additional 45 languages and 10 language families (13 in total), and evaluated on all 90 languages.

Hallucination Morphological Inflection

Paper
Code

Applying the Transformer to Character-level Transduction

2 code implementations • EACL 2021 • Shijie Wu, Ryan Cotterell, Mans Hulden

The transformer has been shown to outperform recurrent neural network-based sequence-to-sequence models in various word-level NLP tasks.

Morphological Inflection Transliteration

Paper
Code

Are All Languages Created Equal in Multilingual BERT?

1 code implementation • WS 2020 • Shijie Wu, Mark Dredze

Multilingual BERT (mBERT) trained on 104 languages has shown surprisingly good cross-lingual performance on several NLP tasks, even without explicit cross-lingual signals.

Cross-Lingual Transfer Dependency Parsing +4

Paper
Code

The Paradigm Discovery Problem

1 code implementation • ACL 2020 • Alexander Erdmann, Micha Elsner, Shijie Wu, Ryan Cotterell, Nizar Habash

Our benchmark system first makes use of word embeddings and string similarity to cluster forms by cell and by paradigm.

Clustering Word Embeddings

Paper
Code

Emerging Cross-lingual Structure in Pretrained Language Models

no code implementations • ACL 2020 • Shijie Wu, Alexis Conneau, Haoran Li, Luke Zettlemoyer, Veselin Stoyanov

We study the problem of multilingual masked language modeling, i. e. the training of a single model on concatenated text from multiple languages, and present a detailed study of several factors that influence why these models are so effective for cross-lingual transfer.

Cross-Lingual Transfer Language Modelling +2

Paper
Add Code

The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection

no code implementations • WS 2019 • Arya D. McCarthy, Ekaterina Vylomova, Shijie Wu, Chaitanya Malaviya, Lawrence Wolf-Sonkin, Garrett Nicolai, Christo Kirov, Miikka Silfverberg, Sabrina J. Mielke, Jeffrey Heinz, Ryan Cotterell, Mans Hulden

The SIGMORPHON 2019 shared task on cross-lingual transfer and contextual analysis in morphology examined transfer learning of inflection between 100 language pairs, as well as contextual lemmatization and morphosyntactic description in 66 languages.

Cross-Lingual Transfer Lemmatization +3

Paper
Add Code

Morphological Irregularity Correlates with Frequency

1 code implementation • ACL 2019 • Shijie Wu, Ryan Cotterell, Timothy J. O'Donnell

We present a study of morphological irregularity.

Paper
Code

Exact Hard Monotonic Attention for Character-Level Transduction

2 code implementations • ACL 2019 • Shijie Wu, Ryan Cotterell

Our models achieve state-of-the-art performance on morphological inflection.

Hard Attention Inductive Bias +1

Paper
Code

Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT

2 code implementations • IJCNLP 2019 • Shijie Wu, Mark Dredze

Pretrained contextual representation models (Peters et al., 2018; Devlin et al., 2018) have pushed forward the state-of-the-art on many NLP tasks.

Ranked #8 on Cross-Lingual NER on CoNLL Spanish

Cross-Lingual NER Dependency Parsing +6

Paper
Code

A Simple Joint Model for Improved Contextual Neural Lemmatization

no code implementations • NAACL 2019 • Chaitanya Malaviya, Shijie Wu, Ryan Cotterell

English verbs have multiple forms.

LEMMA Lemmatization +1

Paper
Add Code

Hard Non-Monotonic Attention for Character-Level Transduction

2 code implementations • EMNLP 2018 • Shijie Wu, Pamela Shapiro, Ryan Cotterell

We compare soft and hard non-monotonic attention experimentally and find that the exact algorithm significantly improves performance over the stochastic approximation and outperforms soft attention.

Hard Attention Image Captioning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.