Search Results for author: Vésteinn Snæbjarnarson

Found 12 papers, 2 papers with code

Miðeind’s WMT 2021 Submission

no code implementations WMT (EMNLP) 2021 Haukur Barri Símonarson, Vésteinn Snæbjarnarson, Pétur Orri Ragnarson, Haukur Jónsson, Vilhjalmur THorsteinsson

We present Miðeind’s submission for the English→Icelandic and Icelandic→English subsets of the 2021 WMT news translation task.

Translation

Natural Questions in Icelandic

no code implementations LREC 2022 Vésteinn Snæbjarnarson, Hafsteinn Einarsson

The dataset is a valuable resource for Icelandic which we demonstrate by creating and evaluating a system capable of extractive QA in Icelandic.

Extractive Question-Answering Natural Questions +1

Context versus Prior Knowledge in Language Models

no code implementations6 Apr 2024 Kevin Du, Vésteinn Snæbjarnarson, Niklas Stoehr, Jennifer C. White, Aaron Schein, Ryan Cotterell

To answer a question, language models often need to integrate prior knowledge learned during pretraining and new information presented in context.

Byte-Level Grammatical Error Correction Using Synthetic and Curated Corpora

1 code implementation29 May 2023 Svanhvít Lilja Ingólfsdóttir, Pétur Orri Ragnarsson, Haukur Páll Jónsson, Haukur Barri Símonarson, Vilhjálmur Þorsteinsson, Vésteinn Snæbjarnarson

We show that a byte-level model enables higher correction quality than a subword approach, not only for simple spelling errors, but also for more complex semantic, stylistic and grammatical issues.

Grammatical Error Correction

Discriminative Class Tokens for Text-to-Image Diffusion Models

1 code implementation ICCV 2023 Idan Schwartz, Vésteinn Snæbjarnarson, Hila Chefer, Ryan Cotterell, Serge Belongie, Lior Wolf, Sagie Benaim

This approach has two disadvantages: (i) supervised datasets are generally small compared to large-scale scraped text-image datasets on which text-to-image models are trained, affecting the quality and diversity of the generated images, or (ii) the input is a hard-coded label, as opposed to free-form text, limiting the control over the generated images.

Assessing Neural Network Robustness via Adversarial Pivotal Tuning

no code implementations17 Nov 2022 Peter Ebert Christensen, Vésteinn Snæbjarnarson, Andrea Dittadi, Serge Belongie, Sagie Benaim

We demonstrate that APT is capable of a wide range of class-preserving semantic image manipulations that fool a variety of pretrained classifiers.

Attribute

Cross-Lingual QA as a Stepping Stone for Monolingual Open QA in Icelandic

no code implementations NAACL (MIA) 2022 Vésteinn Snæbjarnarson, Hafsteinn Einarsson

Our approach requires only limited QA resources in the given language, along with machine-translated data, and at least a bilingual language model.

Language Modelling Open-Ended Question Answering

A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language Models

no code implementations14 Jan 2022 Vésteinn Snæbjarnarson, Haukur Barri Símonarson, Pétur Orri Ragnarsson, Svanhvít Lilja Ingólfsdóttir, Haukur Páll Jónsson, Vilhjálmur Þorsteinsson, Hafsteinn Einarsson

To train the models we introduce a new corpus of Icelandic text, the Icelandic Common Crawl Corpus (IC3), a collection of high quality texts found online by targeting the Icelandic top-level-domain (TLD).

Constituency Parsing Grammatical Error Detection +4

Miðeind's WMT 2021 submission

no code implementations15 Sep 2021 Haukur Barri Símonarson, Vésteinn Snæbjarnarson, Pétur Orri Ragnarsson, Haukur Páll Jónsson, Vilhjálmur Þorsteinsson

We present Mi{\dh}eind's submission for the English$\to$Icelandic and Icelandic$\to$English subsets of the 2021 WMT news translation task.

Translation

Icelandic Parallel Abstracts Corpus

no code implementations11 Aug 2021 Haukur Barri Símonarson, Vésteinn Snæbjarnarson

We present a new Icelandic-English parallel corpus, the Icelandic Parallel Abstracts Corpus (IPAC), composed of abstracts from student theses and dissertations.

NMT Sentence +1

Cannot find the paper you are looking for? You can Submit a new open access paper.