Search Results for author: Nikolay Bogoychev

Found 29 papers, 11 papers with code

Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting

no code implementations9 Oct 2023 Nikolay Bogoychev, Pinzhen Chen

Terminology correctness is important in the downstream application of machine translation, and a prevalent way to ensure this is to inject terminology constraints into a translation system.

Language Modelling Large Language Model +3

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

1 code implementation16 Sep 2023 Pinzhen Chen, Shaoxiong Ji, Nikolay Bogoychev, Andrey Kutuzov, Barry Haddow, Kenneth Heafield

Foundational large language models (LLMs) can be instruction-tuned to perform open-domain question answering, facilitating applications like chat assistants.

Instruction Following Large Language Model +3

An Open Dataset and Model for Language Identification

1 code implementation23 May 2023 Laurie Burchell, Alexandra Birch, Nikolay Bogoychev, Kenneth Heafield

We achieve this by training on a curated dataset of monolingual data, the reliability of which we ensure by auditing a sample from each source and each language manually.

Language Identification

The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR

no code implementations31 Mar 2023 Ramon Sanabria, Nikolay Bogoychev, Nina Markl, Andrea Carmantini, Ondrej Klejch, Peter Bell

Although the great many advances in English automatic speech recognition (ASR) over the past decades, results are usually reported based on test datasets which fail to represent the diversity of English as spoken today around the globe.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Not all parameters are born equal: Attention is mostly what you need

1 code implementation EMNLP (BlackboxNLP) 2021 Nikolay Bogoychev

We show that the decision about whether a component is frozen or allowed to train is at least as important for the final model performance as its number of parameters.

Language Modelling Machine Translation +1

Character Mapping and Ad-hoc Adaptation: Edinburgh's IWSLT 2020 Open Domain Translation System

1 code implementation WS 2020 Pin-zhen Chen, Nikolay Bogoychev, Ulrich Germann

This paper describes the University of Edinburgh{'}s neural machine translation systems submitted to the IWSLT 2020 open domain Japanese$\leftrightarrow$Chinese translation task.

Machine Translation Translation

Domain, Translationese and Noise in Synthetic Data for Neural Machine Translation

no code implementations6 Nov 2019 Nikolay Bogoychev, Rico Sennrich

The quality of neural machine translation can be improved by leveraging additional monolingual resources to create synthetic training data.

Machine Translation Translation

From Research to Production and Back: Ludicrously Fast Neural Machine Translation

no code implementations WS 2019 Young Jin Kim, Marcin Junczys-Dowmunt, Hany Hassan, Alham Fikri Aji, Kenneth Heafield, Roman Grundkiewicz, Nikolay Bogoychev

Taking our dominating submissions to the previous edition of the shared task as a starting point, we develop improved teacher-student training via multi-agent dual-learning and noisy backward-forward translation for Transformer-based student models.

C++ code Machine Translation +1

Accelerating Asynchronous Stochastic Gradient Descent for Neural Machine Translation

no code implementations EMNLP 2018 Nikolay Bogoychev, Marcin Junczys-Dowmunt, Kenneth Heafield, Alham Fikri Aji

In order to extract the best possible performance from asynchronous stochastic gradient descent one must increase the mini-batch size and scale the learning rate accordingly.

Machine Translation Translation

Marian: Fast Neural Machine Translation in C++

2 code implementations ACL 2018 Marcin Junczys-Dowmunt, Roman Grundkiewicz, Tomasz Dwojak, Hieu Hoang, Kenneth Heafield, Tom Neckermann, Frank Seide, Ulrich Germann, Alham Fikri Aji, Nikolay Bogoychev, André F. T. Martins, Alexandra Birch

We present Marian, an efficient and self-contained Neural Machine Translation framework with an integrated automatic differentiation engine based on dynamic computation graphs.

Machine Translation Translation

Fast, Scalable Phrase-Based SMT Decoding

no code implementations AMTA 2016 Hieu Hoang, Nikolay Bogoychev, Lane Schwartz, Marcin Junczys-Dowmunt

The utilization of statistical machine translation (SMT) has grown enormously over the last decade, many using open-source software developed by the NLP community.

Machine Translation Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.