Search Results for author: Marko Tadić

Found 14 papers, 0 papers with code

Paper
Add Code

Curated Multilingual Language Resources for CEF AT (CURLICAT): overall view

no code implementations • EAMT 2022 • Tamás Váradi, Marko Tadić, Svetla Koeva, Maciej Ogrodniczuk, Dan Tufiş, Radovan Garabík, Simon Krek, Andraž Repar

The work in progress on the CEF Action CURLICA T is presented.

Paper
Add Code

National Language Technology Platform (NLTP): overall view

no code implementations • EAMT 2022 • Artūrs Vasiļevskis, Jānis Ziediņš, Marko Tadić, None Željka Motika, Mark Fishel, Hrafn Loftsson, Jón Gu, Claudia Borg, Keith Cortis, Judie Attard, Donatienne Spiteri

The work in progress on the CEF Action National Language Technology Platform (NLTP) is presented.

Paper
Add Code

National Language Technology Platform for Public Administration

no code implementations • TDLE (LREC) 2022 • Marko Tadić, Daša Farkaš, Matea Filko, Artūrs Vasiļevskis, Andrejs Vasiļjevs, Jānis Ziediņš, Željka Motika, Mark Fishel, Hrafn Loftsson, Jón Guðnason, Claudia Borg, Keith Cortis, Judie Attard, Donatienne Spiteri

This article presents the work in progress on the collaborative project of several European countries to develop National Language Technology Platform (NLTP).

Paper
Add Code

Introducing the CURLICAT Corpora: Seven-language Domain Specific Annotated Corpora from Curated Sources

no code implementations • LREC 2022 • Tamás Váradi, Bence Nyéki, Svetla Koeva, Marko Tadić, Vanja Štefanec, Maciej Ogrodniczuk, Bartłomiej Nitoń, Piotr Pęzik, Verginica Barbu Mititelu, Elena Irimia, Maria Mitrofan, Dan Tufiș, Radovan Garabík, Simon Krek, Andraž Repar

This article presents the current outcomes of the CURLICAT CEF Telecom project, which aims to collect and deeply annotate a set of large corpora from selected domains.

NMT

Paper
Add Code

Multilingual Comparative Analysis of Deep-Learning Dependency Parsing Results Using Parallel Corpora

no code implementations • LREC (BUCC) 2022 • Diego Alves, Marko Tadić, Božo Bekavac

This article presents a comparative analysis of dependency parsing results for a set of 16 languages, coming from a large variety of linguistic families and genera, whose parallel corpora were used to train a deep-learning tool.

Dependency Parsing Language Modelling

Paper
Add Code

M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets

no code implementations • 2 Apr 2024 • Gaurish Thakkar, Sherzod Hakimov, Marko Tadić

In recent years, multimodal natural language processing, aimed at learning from diverse data types, has garnered significant attention.

Language Modelling Large Language Model +1

Paper
Add Code

CroSentiNews 2.0: A Sentence-Level News Sentiment Corpus

no code implementations • 14 May 2023 • Gaurish Thakkar, Nives Mikelic Preradović, Marko Tadić

This article presents a sentence-level sentiment dataset for the Croatian news domain.

Sentence

Paper
Add Code

Croatian Film Review Dataset (Cro-FiReDa): A Sentiment Annotated Dataset of Film Reviews

no code implementations • 14 May 2023 • Gaurish Thakkar, Nives Mikelic Preradovic, Marko Tadić

This paper introduces Cro-FiReDa, a sentiment- annotated dataset for Croatian in the domain of movie reviews.

Sentence

Paper
Add Code

Building and Evaluating Universal Named-Entity Recognition English corpus

no code implementations • 14 Dec 2022 • Diego Alves, Gaurish Thakkar, Marko Tadić

This article presents the application of the Universal Named Entity framework to generate automatically annotated corpora.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Building Multilingual Corpora for a Complex Named Entity Recognition and Classification Hierarchy using Wikipedia and DBpedia

no code implementations • 14 Dec 2022 • Diego Alves, Gaurish Thakkar, Gabriel Amaral, Tin Kuculo, Marko Tadić

With the ever-growing popularity of the field of NLP, the demand for datasets in low resourced-languages follows suit.

Named Entity Recognition Named Entity Recognition (NER)

Paper
Add Code

Natural Language Processing Chains Inside a Cross-lingual Event-Centric Knowledge Pipeline for European Union Under-resourced Languages

no code implementations • LREC 2020 • Diego Alves, Gaurish Thakkar, Marko Tadić

Due to the differences in terms of availability of language resources for each language, we have built this strategy in three steps, starting with processing chains for the well-resourced languages and finishing with the development of new modules for the under-resourced ones.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Evaluating Language Tools for Fifteen EU-official Under-resourced Languages

no code implementations • LREC 2020 • Diego Alves, Gaurish Thakkar, Marko Tadić

We considered the difference between reported and our tested results within a single percentage point as being within the limits of acceptable tolerance and thus consider this result as reproducible.

Paper
Add Code

The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe

no code implementations • LREC 2020 • Georg Rehm, Katrin Marheinecke, Stefanie Hegele, Stelios Piperidis, Kalina Bontcheva, Jan Hajič, Khalid Choukri, Andrejs Vasiļjevs, Gerhard Backfried, Christoph Prinz, José Manuel Gómez Pérez, Luc Meertens, Paul Lukowicz, Josef van Genabith, Andrea Lösch, Philipp Slusallek, Morten Irgens, Patrick Gatellier, Joachim köhler, Laure Le Bars, Dimitra Anastasiou, Albina Auksoriūtė, Núria Bel, António Branco, Gerhard Budin, Walter Daelemans, Koenraad De Smedt, Radovan Garabík, Maria Gavriilidou, Dagmar Gromann, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Jan Odijk, Maciej Ogrodniczuk, Eiríkur Rögnvaldsson, Mike Rosner, Bolette Sandford Pedersen, Inguna Skadiņa, Marko Tadić, Dan Tufiş, Tamás Váradi, Kadri Vider, Andy Way, François Yvon

Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality.

Misconceptions

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.