no code implementations • LREC 2022 • Teemu Vahtola, Eetu Sjöblom, Jörg Tiedemann, Mathias Creutz
Noisy labels in training data present a challenging issue in classification tasks, misleading a model towards incorrect decisions during training.
no code implementations • WNUT (ACL) 2021 • Teemu Vahtola, Mathias Creutz, Eetu Sjöblom, Sami Itkonen
We present new state-of-the-art benchmarks for paraphrase detection on all six languages in the Opusparcus sentential paraphrase corpus: English, Finnish, French, German, Russian, and Swedish.
no code implementations • NoDaLiDa 2021 • Eetu Sjöblom, Mathias Creutz, Teemu Vahtola
We perform neural machine translation of sentence fragments in order to create large amounts of training data for English grammatical error correction.
no code implementations • WS 2018 • Eetu Sjöblom, Mathias Creutz, Mikko Aulamo
We perform automatic paraphrase detection on subtitle data from the Opusparcus corpus comprising six European languages: German, English, Finnish, French, Russian, and Swedish.