1 code implementation • FNP (COLING) 2020 • Marius Ionescu, Andrei-Marius Avram, George-Andrei Dima, Dumitru-Clementin Cercel, Mihai Dascalu
Financial causality detection is centered on identifying connections between different assets from financial news in order to improve trading strategies.
no code implementations • SMM4H (COLING) 2022 • Andrei-Marius Avram, Vasile Pais, Maria Mitrofan
This paper presents our system employed for the Social Media Mining for Health (SMM4H) 2022 competition Task 10 - SocialDisNER.
no code implementations • SMM4H (COLING) 2020 • George-Andrei Dima, Andrei-Marius Avram, Dumitru-Clementin Cercel
This paper describes our solutions submitted to the Social Media Mining for Health Applications (#SMM4H) Shared Task 2020.
no code implementations • CLIB 2020 • Andrei-Marius Avram, Verginica Barbu Mititelu
This paper presents an open-source wordnet editor that has been developed to ensure further expansion of the Romanian wordnet.
no code implementations • LDL (ACL) 2022 • Verginica Barbu Mititelu, Elena Irimia, Vasile Pais, Andrei-Marius Avram, Maria Mitrofan
In this paper, we report on (i) the conversion of Romanian language resources to the Linked Open Data specifications and requirements, on (ii) their publication and (iii) interlinking with other language resources (for Romanian or for other languages).
no code implementations • loresmt (COLING) 2022 • Vasile Pais, Maria Mitrofan, Andrei-Marius Avram
This paper presents the usage of the RELATE platform for translation tasks involving the Romanian language.
no code implementations • EACL (VarDial) 2021 • George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Traian Rebedea
Dialect identification is a task with applicability in a vast array of domains, ranging from automatic speech recognition to opinion mining.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • VarDial (COLING) 2020 • George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Traian Rebedea
Dialect identification represents a key aspect for improving a series of tasks, for example, opinion mining, considering that the location of the speaker can greatly influence the attitude towards a subject.
no code implementations • 30 Apr 2024 • Andrei-Marius Avram, Andreea Iuga, George-Vlad Manolache, Vlad-Cristian Matei, Răzvan-Gabriel Micliuş, Vlad-Andrei Muntean, Manuel-Petru Sorlescu, Dragoş-Andrei Şerban, Adrian-Dinu Urse, Vasile Păiş, Dumitru-Clementin Cercel
This work introduces HistNERo, the first Romanian corpus for Named Entity Recognition (NER) in historical newspapers.
no code implementations • 7 Oct 2023 • Emilian-Claudiu Mănescu, Răzvan-Alexandru Smădu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop
Lip reading or visual speech recognition has gained significant attention in recent years, particularly because of hardware development and innovations in computer vision.
no code implementations • 30 Jun 2023 • Andrei-Marius Avram, Răzvan-Alexandru Smădu, Vasile Păiş, Dumitru-Clementin Cercel, Radu Ion, Dan Tufiş
With the rise of bidirectional encoder representations from Transformer models in natural language processing, the speech community has adopted some of their development methodologies.
no code implementations • 17 Jun 2023 • Andrei-Marius Avram, Verginica Barbu Mititelu, Vasile Păiş, Dumitru-Clementin Cercel, Ştefan Trăuşan-Matu
Correctly identifying multiword expressions (MWEs) is an important task for most natural language processing systems since their misidentification can result in ambiguity and misunderstanding of the underlying text.
no code implementations • 13 Jun 2023 • Sebastian-Vasile Echim, Răzvan-Alexandru Smădu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop
Satire detection and sentiment analysis are intensively explored natural language processing (NLP) tasks that study the identification of the satirical tone from texts and extracting sentiments in relationship with their targets.
no code implementations • 11 Jun 2023 • Iulian-Marius Tăiatu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop
Developing natural language processing (NLP) systems for social media analysis remains an important topic in artificial intelligence research.
no code implementations • 22 Apr 2023 • Andrei-Marius Avram, Verginica Barbu Mititelu, Dumitru-Clementin Cercel
Multiword expressions are a key ingredient for developing large-scale and linguistically sound natural language processing technology.
no code implementations • 30 Dec 2022 • Răzvan-Alexandru Smădu, George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Mihai Dascalu, Florin Pop
Keyphrase identification and classification is a Natural Language Processing and Information Retrieval task that involves extracting relevant groups of words from a given text related to the main topic.
no code implementations • CLIB 2022 • Radu Ion, Andrei-Marius Avram, Vasile Păiş, Maria Mitrofan, Verginica Barbu Mititelu, Elena Irimia, Valentin Badea
The paper will present the QA system and its integration with the Romanian language technologies portal RELATE, the COVID-19 data set and different evaluations of the QA performance.
1 code implementation • LREC 2022 • Andrei-Marius Avram, Darius Catrina, Dumitru-Clementin Cercel, Mihai Dascălu, Traian Rebedea, Vasile Păiş, Dan Tufiş
In this work, we introduce three light and fast versions of distilled BERT models for the Romanian language: Distil-BERT-base-ro, Distil-RoBERT-base, and DistilMulti-BERT-base-ro.
1 code implementation • 23 Nov 2021 • Andrei-Marius Avram, Vasile Păiş, Dan Tufiş
One of the fundamental functionalities for accepting a socially assistive robot is its communication capabilities with other agents in the environment.
no code implementations • 22 Nov 2021 • Vasile Păiş, Radu Ion, Andrei-Marius Avram, Elena Irimia, Verginica Barbu Mititelu, Maria Mitrofan
The paper contains a detailed description of the acquisition process, corpus statistics as well as an evaluation of the corpus influence on a low-latency ASR system as well as a dialogue component.
2 code implementations • RANLP 2021 • Andrei-Marius Avram, Vasile Pais, Dan Tufis
EuroVoc is a multilingual thesaurus that was built for organizing the legislative documentary of the European Union institutions.
no code implementations • SEMEVAL 2021 • Andrei-Marius Avram, George-Eduard Zaharia, Dumitru-Clementin Cercel, Mihai Dascalu
Extracting semantic information on measurements and counts is an important topic in terms of analyzing scientific discourses.
1 code implementation • Findings of the Association for Computational Linguistics 2020 • Stefan Daniel Dumitrescu, Andrei-Marius Avram, Sampo Pyysalo
Large-scale pretrained language models have become ubiquitous in Natural Language Processing.
3 code implementations • SEMEVAL 2020 • Andrei-Marius Avram, Dumitru-Clementin Cercel, Costin-Gabriel Chiru
This work presents our contribution in the context of the 6th task of SemEval-2020: Extracting Definitions from Free Text in Textbooks (DeftEval).
1 code implementation • LREC 2020 • Stefan Daniel Dumitrescu, Andrei-Marius Avram
We present RONEC - the Named Entity Corpus for the Romanian language.
2 code implementations • 3 Sep 2019 • Stefan Daniel Dumitrescu, Andrei-Marius Avram
We present RONEC - the Named Entity Corpus for the Romanian language.