no code implementations • EAMT 2022 • Tamás Váradi, Marko Tadić, Svetla Koeva, Maciej Ogrodniczuk, Dan Tufiş, Radovan Garabík, Simon Krek, Andraž Repar
The work in progress on the CEF Action CURLICA T is presented.
no code implementations • EAMT 2022 • Artūrs Vasiļevskis, Jānis Ziediņš, Marko Tadić, None Željka Motika, Mark Fishel, Hrafn Loftsson, Jón Gu, Claudia Borg, Keith Cortis, Judie Attard, Donatienne Spiteri
The work in progress on the CEF Action National Language Technology Platform (NLTP) is presented.
no code implementations • TDLE (LREC) 2022 • Marko Tadić, Daša Farkaš, Matea Filko, Artūrs Vasiļevskis, Andrejs Vasiļjevs, Jānis Ziediņš, Željka Motika, Mark Fishel, Hrafn Loftsson, Jón Guðnason, Claudia Borg, Keith Cortis, Judie Attard, Donatienne Spiteri
This article presents the work in progress on the collaborative project of several European countries to develop National Language Technology Platform (NLTP).
no code implementations • LREC 2022 • Tamás Váradi, Bence Nyéki, Svetla Koeva, Marko Tadić, Vanja Štefanec, Maciej Ogrodniczuk, Bartłomiej Nitoń, Piotr Pęzik, Verginica Barbu Mititelu, Elena Irimia, Maria Mitrofan, Dan Tufiș, Radovan Garabík, Simon Krek, Andraž Repar
This article presents the current outcomes of the CURLICAT CEF Telecom project, which aims to collect and deeply annotate a set of large corpora from selected domains.
no code implementations • LREC (BUCC) 2022 • Diego Alves, Marko Tadić, Božo Bekavac
This article presents a comparative analysis of dependency parsing results for a set of 16 languages, coming from a large variety of linguistic families and genera, whose parallel corpora were used to train a deep-learning tool.
no code implementations • 2 Apr 2024 • Gaurish Thakkar, Sherzod Hakimov, Marko Tadić
In recent years, multimodal natural language processing, aimed at learning from diverse data types, has garnered significant attention.
no code implementations • 14 May 2023 • Gaurish Thakkar, Nives Mikelic Preradović, Marko Tadić
This article presents a sentence-level sentiment dataset for the Croatian news domain.
no code implementations • 14 May 2023 • Gaurish Thakkar, Nives Mikelic Preradovic, Marko Tadić
This paper introduces Cro-FiReDa, a sentiment- annotated dataset for Croatian in the domain of movie reviews.
no code implementations • 14 Dec 2022 • Diego Alves, Gaurish Thakkar, Marko Tadić
This article presents the application of the Universal Named Entity framework to generate automatically annotated corpora.
no code implementations • 14 Dec 2022 • Diego Alves, Gaurish Thakkar, Gabriel Amaral, Tin Kuculo, Marko Tadić
With the ever-growing popularity of the field of NLP, the demand for datasets in low resourced-languages follows suit.
no code implementations • LREC 2020 • Diego Alves, Gaurish Thakkar, Marko Tadić
Due to the differences in terms of availability of language resources for each language, we have built this strategy in three steps, starting with processing chains for the well-resourced languages and finishing with the development of new modules for the under-resourced ones.
no code implementations • LREC 2020 • Diego Alves, Gaurish Thakkar, Marko Tadić
We considered the difference between reported and our tested results within a single percentage point as being within the limits of acceptable tolerance and thus consider this result as reproducible.
no code implementations • LREC 2020 • Georg Rehm, Katrin Marheinecke, Stefanie Hegele, Stelios Piperidis, Kalina Bontcheva, Jan Hajič, Khalid Choukri, Andrejs Vasiļjevs, Gerhard Backfried, Christoph Prinz, José Manuel Gómez Pérez, Luc Meertens, Paul Lukowicz, Josef van Genabith, Andrea Lösch, Philipp Slusallek, Morten Irgens, Patrick Gatellier, Joachim köhler, Laure Le Bars, Dimitra Anastasiou, Albina Auksoriūtė, Núria Bel, António Branco, Gerhard Budin, Walter Daelemans, Koenraad De Smedt, Radovan Garabík, Maria Gavriilidou, Dagmar Gromann, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Jan Odijk, Maciej Ogrodniczuk, Eiríkur Rögnvaldsson, Mike Rosner, Bolette Sandford Pedersen, Inguna Skadiņa, Marko Tadić, Dan Tufiş, Tamás Váradi, Kadri Vider, Andy Way, François Yvon
Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality.