no code implementations • LREC 2020 • Aleksi Sahala, Miikka Silfverberg, Antti Arppe, Krister Lind{\'e}n
Akkadian is a fairly well resourced extinct language that does not yet have a comprehensive morphological analyzer available.
no code implementations • LREC 2020 • Heidi Jauhiainen, Tommi Jauhiainen, Krister Lind{\'e}n
Web corpora creation for minority languages that do not have their own top-level Internet domain is no trivial matter.
no code implementations • LREC 2020 • Aleksi Sahala, Miikka Silfverberg, Antti Arppe, Krister Lind{\'e}n
Several Akkadian text corpora contain only the transliterated text.
no code implementations • WS 2019 • Tommi Jauhiainen, Krister Lind{\'e}n, Heidi Jauhiainen
This paper describes the language identification systems used by the SUKI team in the Discriminating between the Mainland and Taiwan variation of Mandarin Chinese (DMT) and the German Dialect Identification (GDI) shared tasks which were held as part of the third VarDial Evaluation Campaign.
no code implementations • COLING 2018 • Tommi Jauhiainen, Heidi Jauhiainen, Krister Lind{\'e}n
This paper presents the experiments and results obtained by the SUKI team in the Indo-Aryan Language Identification shared task of the VarDial 2018 Evaluation Campaign.
no code implementations • COLING 2018 • Tommi Jauhiainen, Heidi Jauhiainen, Krister Lind{\'e}n
This paper presents the experiments and results obtained by the SUKI team in the Discriminating between Dutch and Flemish in Subtitles shared task of the VarDial 2018 Evaluation Campaign.
no code implementations • COLING 2018 • Tommi Jauhiainen, Heidi Jauhiainen, Krister Lind{\'e}n
In this paper we present the experiments and results by the SUKI team in the German Dialect Identification shared task of the VarDial 2018 Evaluation Campaign.
no code implementations • WS 2017 • Tommi Jauhiainen, Krister Lind{\'e}n, Heidi Jauhiainen
In this paper we describe the non-linear mappings we used with the Helsinki language identification method, HeLI, in the 4th edition of the Discriminating between Similar Languages (DSL) shared task, which was organized as part of the VarDial 2017 workshop.
1 code implementation • WS 2016 • Tommi Jauhiainen, Krister Lind{\'e}n, Heidi Jauhiainen
The shared task comprised of a total of 8 tracks, of which we participated in 7.
no code implementations • LREC 2014 • Dimitrios Kokkinakis, Jyrki Niemi, Sam Hardwick, Krister Lind{\'e}n, Lars Borin
Named entity recognition (NER) is a knowledge-intensive information extraction task that is used for recognizing textual mentions of entities that belong to a predefined set of categories, such as locations, organizations and time expressions.
no code implementations • LREC 2014 • Senka Drobac, Krister Lind{\'e}n, Tommi Pirinen, Miikka Silfverberg
The most noticeable reduction in size we got with a morphological transducer for Greenlandic, whose original size is on average about 15 times larger than other morphologies.
no code implementations • LREC 2014 • Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, N{\'u}ria Bel, Audron{\.e} Bielevi{\v{c}}ien{\.e}, Lars Borin, Ant{\'o}nio Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garab{\'\i}k, Marko Grobelnik, Carmen Garc{\'\i}a-Mateo, Josef van Genabith, Jan Haji{\v{c}}, Inma Hern{\'a}ez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lind{\'e}n, Bernardo Magnini, Joseph Mariani, John McNaught, Maite Melero, Monica Monachini, Asunci{\'o}n Moreno, Jan Odijk, Maciej Ogrodniczuk, Piotr P{\k{e}}zik, Stelios Piperidis, Adam Przepi{\'o}rkowski, Eir{\'\i}kur R{\"o}gnvaldsson, Michael Rosner, Bolette Pedersen, Inguna Skadi{\c{n}}a, Koenraad De Smedt, Marko Tadi{\'c}, Paul Thompson, Dan Tufi{\c{s}}, Tam{\'a}s V{\'a}radi, Andrejs Vasi{\c{l}}jevs, Kadri Vider, Jolanta Zabarskaite
This article provides an overview of the dissemination work carried out in META-NET from 2010 until early 2014; we describe its impact on the regional, national and international level, mainly with regard to politics and the situation of funding for LT topics.
no code implementations • LREC 2014 • Koenraad De Smedt, Erhard Hinrichs, Detmar Meurers, Inguna Skadi{\c{n}}a, Bolette Pedersen, Costanza Navarretta, N{\'u}ria Bel, Krister Lind{\'e}n, Mark{\'e}ta Lopatkov{\'a}, Jan Haji{\v{c}}, Gisle Andersen, Przemyslaw Lenkiewicz
CLARA (Common Language Resources and Their Applications) is a Marie Curie Initial Training Network which ran from 2009 until 2014 with the aim of providing researcher training in crucial areas related to language resources and infrastructure.
no code implementations • WS 2013 • S, Bolette ford Pedersen, Lars Borin, Markus Forsberg, Neeme Kahusk, Krister Lind{\'e}n, Jyrki Niemi, Niklas Nisbeth, Lars Nygaard, Heili Orav, Eirikur R{\"o}gnvaldsson, Mitchell Seaton, Kadri Vider, Kaarlo Voionmaa
no code implementations • LREC 2012 • Atro Voutilainen, Kristiina Muhonen, Tanja Purtonen, Krister Lind{\'e}n
We argue for use of large descriptive grammars and their sample sentences as a basis for specifying higher-coverage grammatical representations.
no code implementations • LREC 2012 • Andrejs Vasi{\c{l}}jevs, Markus Forsberg, Tatiana Gornostay, Dorte Haltrup Hansen, Krist{\'\i}n J{\'o}hannsd{\'o}ttir, Gunn Lyse, Krister Lind{\'e}n, Lene Offersgaard, Sussi Olsen, Bolette Pedersen, Eir{\'\i}kur R{\"o}gnvaldsson, Inguna Skadi{\c{n}}a, Koenraad De Smedt, Ville Oksanen, Roberts Rozis
The META-NORD project has contributed to an open infrastructure for language resources (data and tools) under the META-NET umbrella.
no code implementations • LREC 2012 • Jyrki Niemi, Krister Lind{\'e}n
FiWN was created by translating all the word senses of the Princeton WordNet (PWN) into Finnish and by joining the translations with the semantic and lexical relations of PWN extracted into a relational (database) format.