Search Results for author: Toms Bergmanis

Found 13 papers, 4 papers with code

A Tale of Eight Countries or the EU Council Presidency Translator in Retrospect

no code implementations • AMTA 2020 • Mārcis Pinnis, Toms Bergmanis, Kristīne Metuzāle, Valters Šics, Artūrs Vasiļevskis, Andrejs Vasiļjevs

Paper
Add Code

MTee: Open Machine Translation Platform for Estonian Government

no code implementations • EAMT 2022 • Toms Bergmanis, Marcis Pinnis, Roberts Rozis, Jānis Šlapiņš, Valters Šics, Berta Bernāne, Guntars Pužulis, Endijs Titomers, Andre Tättar, Taido Purason, Hele-Andra Kuulmets, Agnes Luhtaru, Liisa Rätsep, Maali Tars, Annika Laumets-Tättar, Mark Fishel

We present the MTee project - a research initiative funded via an Estonian public procurement to develop machine translation technology that is open-source and free of charge.

Document Translation Grammatical Error Correction +2

Paper
Add Code

From Zero to Production: Baltic-Ukrainian Machine Translation Systems to Aid Refugees

no code implementations • 28 Sep 2022 • Toms Bergmanis, Mārcis Pinnis

In this paper, we examine the development and usage of six low-resource machine translation systems translating between the Ukrainian language and each of the official languages of the Baltic states.

Machine Translation Translation

Paper
Add Code

Open Terminology Management and Sharing Toolkit for Federation of Terminology Databases

no code implementations • LREC 2022 • Andis Lagzdiņš, Uldis Siliņš, Mārcis Pinnis, Toms Bergmanis, Artūrs Vasiļevskis, Andrejs Vasiļjevs

Consolidated access to current and reliable terms from different subject fields and languages is necessary for content creators and translators.

Machine Translation Management +3

Paper
Add Code

Dynamic Terminology Integration for COVID-19 and other Emerging Domains

no code implementations • WMT (EMNLP) 2021 • Toms Bergmanis, Mārcis Pinnis

The majority of language domains require prudent use of terminology to ensure clarity and adequacy of information conveyed.

Machine Translation Translation

Paper
Add Code

Facilitating Terminology Translation with Target Lemma Annotations

1 code implementation • EACL 2021 • Toms Bergmanis, Mārcis Pinnis

Most of the recent work on terminology integration in machine translation has assumed that terminology translations are given already inflected in forms that are suitable for the target language sentence.

Data Augmentation LEMMA +3

Paper
Code

Mitigating Gender Bias in Machine Translation with Target Gender Annotations

1 code implementation • WMT (EMNLP) 2020 • Artūrs Stafanovičs, Toms Bergmanis, Mārcis Pinnis

to a language with grammatical gender, it might be necessary to determine the gender of the subject "secretary".

Machine Translation Sentence +1

Paper
Code

Robust Neural Machine Translation: Modeling Orthographic and Interpunctual Variation

no code implementations • 11 Sep 2020 • Toms Bergmanis, Artūrs Stafanovičs, Mārcis Pinnis

Neural machine translation systems typically are trained on curated corpora and break when faced with non-standard orthography or punctuation.

Machine Translation Sentence +1

Paper
Add Code

Training Data Augmentation for Context-Sensitive Neural Lemmatizer Using Inflection Tables and Raw Text

1 code implementation • NAACL 2019 • Toms Bergmanis, Sharon Goldwater

Lemmatization aims to reduce the sparse data problem by relating the inflected forms of a word to its dictionary form.

Data Augmentation LEMMA +2

Paper
Code

Training Data Augmentation for Context-Sensitive Neural Lemmatization Using Inflection Tables and Raw Text

1 code implementation • 2 Apr 2019 • Toms Bergmanis, Sharon Goldwater

Lemmatization aims to reduce the sparse data problem by relating the inflected forms of a word to its dictionary form.

Data Augmentation LEMMA +2

Paper
Code

Context Sensitive Neural Lemmatization with Lematus

no code implementations • NAACL 2018 • Toms Bergmanis, Sharon Goldwater

The main motivation for developing contextsensitive lemmatizers is to improve performance on unseen and ambiguous words.

Decoder Lemmatization +3

Paper
Add Code

Training Data Augmentation for Low-Resource Morphological Inflection

no code implementations • CONLL 2017 • Toms Bergmanis, Katharina Kann, Hinrich Sch{\"u}tze, Sharon Goldwater

Data Augmentation Morphological Inflection +1

Paper
Add Code

From Segmentation to Analyses: a Probabilistic Model for Unsupervised Morphology Induction

no code implementations • EACL 2017 • Toms Bergmanis, Sharon Goldwater

A major motivation for unsupervised morphological analysis is to reduce the sparse data problem in under-resourced languages.

Morphological Analysis Segmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.