Search Results for author: Ahmet Üstün

Found 19 papers, 9 papers with code

Unsupervised Translation of German–Lower Sorbian: Exploring Training and Novel Transfer Methods on a Low-Resource Language

no code implementations • WMT (EMNLP) 2021 • Lukas Edman, Ahmet Üstün, Antonio Toral, Gertjan van Noord

This paper describes the methods behind the systems submitted by the University of Groningen for the WMT 2021 Unsupervised Machine Translation task for German–Lower Sorbian (DE–DSB): a high-resource language to a low-resource one.

Translation Unsupervised Machine Translation

Paper
Add Code

UDapter: Typology-based Language Adapters for Multilingual Dependency Parsing and Sequence Labeling

no code implementations • CL (ACL) 2022 • Ahmet Üstün, Arianna Bisazza, Gosse Bouma, Gertjan van Noord

To address this, we propose a novel language adaptation approach by introducing contextual language adapters to a multilingual parser.

Dependency Parsing Language Modelling +4

Paper
Add Code

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

no code implementations • 22 Feb 2024 • Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker

AI alignment in the shape of Reinforcement Learning from Human Feedback (RLHF) is increasingly treated as a crucial ingredient for high performance large language models.

Paper
Add Code

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

no code implementations • 12 Feb 2024 • Ahmet Üstün, Viraat Aryabumi, Zheng-Xin Yong, Wei-Yin Ko, Daniel D'souza, Gbemileke Onilude, Neel Bhandari, Shivalika Singh, Hui-Lee Ooi, Amr Kayid, Freddie Vargus, Phil Blunsom, Shayne Longpre, Niklas Muennighoff, Marzieh Fadaee, Julia Kreutzer, Sara Hooker

Recent breakthroughs in large language models (LLMs) have centered around a handful of data-rich languages.

Language Modelling

Paper
Add Code

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

no code implementations • 9 Feb 2024 • Shivalika Singh, Freddie Vargus, Daniel Dsouza, Börje F. Karlsson, Abinaya Mahendiran, Wei-Yin Ko, Herumb Shandilya, Jay Patel, Deividas Mataciunas, Laura OMahony, Mike Zhang, Ramith Hettiarachchi, Joseph Wilson, Marina Machado, Luisa Souza Moura, Dominik Krzemiński, Hakimeh Fadaei, Irem Ergün, Ifeoma Okoh, Aisha Alaagib, Oshan Mudannayake, Zaid Alyafeai, Vu Minh Chien, Sebastian Ruder, Surya Guthikonda, Emad A. Alghamdi, Sebastian Gehrmann, Niklas Muennighoff, Max Bartolo, Julia Kreutzer, Ahmet Üstün, Marzieh Fadaee, Sara Hooker

The Aya initiative also serves as a valuable case study in participatory research, involving collaborators from 119 countries.

Instruction Following Language Modelling +1

Paper
Add Code

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning

1 code implementation • 11 Sep 2023 • Ted Zadouri, Ahmet Üstün, Arash Ahmadian, Beyza Ermiş, Acyr Locatelli, Sara Hooker

The Mixture of Experts (MoE) is a widely known neural architecture where an ensemble of specialized sub-models optimizes overall performance with a constant computational cost.

214

Paper
Code

When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale

no code implementations • 8 Sep 2023 • Max Marion, Ahmet Üstün, Luiza Pozzobon, Alex Wang, Marzieh Fadaee, Sara Hooker

In this work, we take a wider view and explore scalable estimates of data quality that can be used to systematically measure the quality of pretraining data.

Memorization

Paper
Add Code

Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer

1 code implementation • 24 May 2022 • Ahmet Üstün, Arianna Bisazza, Gosse Bouma, Gertjan van Noord, Sebastian Ruder

Massively multilingual models are promising for transfer learning across tasks and languages.

Transfer Learning

Paper
Code

When does Parameter-Efficient Transfer Learning Work for Machine Translation?

1 code implementation • 23 May 2022 • Ahmet Üstün, Asa Cooper Stickland

We find that using PEFTs with a larger pre-trained model outperforms full fine-tuning with a smaller model, and for smaller training data sizes, PEFTs outperform full fine-tuning for the same pre-trained model.

Machine Translation Transfer Learning +1

Paper
Code

Multilingual Unsupervised Neural Machine Translation with Denoising Adapters

no code implementations • EMNLP 2021 • Ahmet Üstün, Alexandre Bérard, Laurent Besacier, Matthias Gallé

We consider the problem of multilingual unsupervised machine translation, translating to and from languages that only have monolingual data by using auxiliary parallel language pairs.

Denoising Translation +1

Paper
Add Code

Unsupervised Translation of German--Lower Sorbian: Exploring Training and Novel Transfer Methods on a Low-Resource Language

1 code implementation • 24 Sep 2021 • Lukas Edman, Ahmet Üstün, Antonio Toral, Gertjan van Noord

Lastly, we experiment with the order in which offline and online back-translation are used to train an unsupervised system, finding that using online back-translation first works better for DE$\rightarrow$DSB by 2. 76 BLEU.

Translation Unsupervised Machine Translation

Paper
Code

On the Difficulty of Translating Free-Order Case-Marking Languages

2 code implementations • 13 Jul 2021 • Arianna Bisazza, Ahmet Üstün, Stephan Sportel

Identifying factors that make certain languages harder to model than others is essential to reach language equality in future Natural Language Processing technologies.

Machine Translation NMT +1

Paper
Code

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding

2 code implementations • NAACL 2021 • Rob van der Goot, Ibrahim Sharaf, Aizhan Imankulova, Ahmet Üstün, Marija Stepanović, Alan Ramponi, Siti Oryza Khairunnisa, Mamoru Komachi, Barbara Plank

To tackle the challenge, we propose a joint learning approach, with English SLU training data and non-English auxiliary tasks from raw text, syntax and translation for transfer.

intent-classification Intent Classification +7

316

Paper
Code

On the Effectiveness of Dataset Embeddings in Mono-lingual,Multi-lingual and Zero-shot Conditions

no code implementations • EACL (AdaptNLP) 2021 • Rob van der Goot, Ahmet Üstün, Barbara Plank

However, it remains unclear in which situations these dataset embeddings are most effective, because they are used in a large variety of settings, languages and tasks.

Dependency Parsing Lemmatization +1

Paper
Add Code

FiSSA at SemEval-2020 Task 9: Fine-tuned For Feelings

1 code implementation • SEMEVAL 2020 • Bertelt Braaksma, Richard Scholtens, Stan van Suijlekom, Remy Wang, Ahmet Üstün

In this paper, we present our approach for sentiment classification on Spanish-English code-mixed social media data in the SemEval-2020 Task 9.

Classification General Classification +3

Paper
Code

Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP

2 code implementations • EACL 2021 • Rob van der Goot, Ahmet Üstün, Alan Ramponi, Ibrahim Sharaf, Barbara Plank

In this paper we present MaChAmp, a toolkit for easy fine-tuning of contextualized embeddings in multi-task settings.

Dependency Parsing Language Modelling +5

Paper
Code

UDapter: Language Adaptation for Truly Universal Dependency Parsing

1 code implementation • EMNLP 2020 • Ahmet Üstün, Arianna Bisazza, Gosse Bouma, Gertjan van Noord

The resulting parser, UDapter, outperforms strong monolingual and multilingual baselines on the majority of both high-resource and low-resource (zero-shot) languages, showing the success of the proposed adaptation approach.

Dependency Parsing Transfer Learning

Paper
Code

A Trie-Structured Bayesian Model for Unsupervised Morphological Segmentation

no code implementations • 24 Apr 2017 • Murathan Kurfali, Ahmet Üstün, Burcu Can

Our results show that using different information sources such as neural word embeddings and letter successor variety as prior information improves morphological segmentation in a Bayesian model.

Segmentation Word Embeddings

Paper
Add Code

Turkish PoS Tagging by Reducing Sparsity with Morpheme Tags in Small Datasets

no code implementations • 9 Mar 2017 • Burcu Can, Ahmet Üstün, Murathan Kurfali

We learn inflectional and derivational morpheme tags in Turkish by using conditional random fields (CRF) and we employ the morpheme tags in part-of-speech (PoS) tagging by using hidden Markov models (HMMs) to mitigate sparsity.

Part-Of-Speech Tagging POS +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.