Search Results for author: Vilhjálmur Þorsteinsson

Found 4 papers, 1 papers with code

Developing a Spell and Grammar Checker for Icelandic using an Error Corpus

no code implementations LREC 2022 Hulda Óladóttir, Þórunn Arnardóttir, Anton Ingason, Vilhjálmur Þorsteinsson

A lack of datasets for spelling and grammatical error correction in Icelandic, along with language-specific issues, has caused a dearth of spell and grammar checking systems for the language.

Grammatical Error Correction Sentence

Byte-Level Grammatical Error Correction Using Synthetic and Curated Corpora

1 code implementation29 May 2023 Svanhvít Lilja Ingólfsdóttir, Pétur Orri Ragnarsson, Haukur Páll Jónsson, Haukur Barri Símonarson, Vilhjálmur Þorsteinsson, Vésteinn Snæbjarnarson

We show that a byte-level model enables higher correction quality than a subword approach, not only for simple spelling errors, but also for more complex semantic, stylistic and grammatical issues.

Grammatical Error Correction

A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language Models

no code implementations14 Jan 2022 Vésteinn Snæbjarnarson, Haukur Barri Símonarson, Pétur Orri Ragnarsson, Svanhvít Lilja Ingólfsdóttir, Haukur Páll Jónsson, Vilhjálmur Þorsteinsson, Hafsteinn Einarsson

To train the models we introduce a new corpus of Icelandic text, the Icelandic Common Crawl Corpus (IC3), a collection of high quality texts found online by targeting the Icelandic top-level-domain (TLD).

Constituency Parsing Grammatical Error Detection +4

Miðeind's WMT 2021 submission

no code implementations15 Sep 2021 Haukur Barri Símonarson, Vésteinn Snæbjarnarson, Pétur Orri Ragnarsson, Haukur Páll Jónsson, Vilhjálmur Þorsteinsson

We present Mi{\dh}eind's submission for the English$\to$Icelandic and Icelandic$\to$English subsets of the 2021 WMT news translation task.

Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.