Search Results for author: Andrei-Marius Avram

Found 26 papers, 8 papers with code

UPB at FinCausal-2020, Tasks 1 & 2: Causality Analysis in Financial Documents using Pretrained Language Models

1 code implementation • FNP (COLING) 2020 • Marius Ionescu, Andrei-Marius Avram, George-Andrei Dima, Dumitru-Clementin Cercel, Mihai Dascalu

Financial causality detection is centered on identifying connections between different assets from financial news in order to improve trading strategies.

Binary Classification

Paper
Code

RACAI@SMM4H’22: Tweets Disease Mention Detection Using a Neural Lateral Inhibitory Mechanism

no code implementations • SMM4H (COLING) 2022 • Andrei-Marius Avram, Vasile Pais, Maria Mitrofan

This paper presents our system employed for the Social Media Mining for Health (SMM4H) 2022 competition Task 10 - SocialDisNER.

Paper
Add Code

Approaching SMM4H 2020 with Ensembles of BERT Flavours

no code implementations • SMM4H (COLING) 2020 • George-Andrei Dima, Andrei-Marius Avram, Dumitru-Clementin Cercel

This paper describes our solutions submitted to the Social Media Mining for Health Applications (#SMM4H) Shared Task 2020.

Task 2

Paper
Add Code

A Customizable WordNet Editor

no code implementations • CLIB 2020 • Andrei-Marius Avram, Verginica Barbu Mititelu

This paper presents an open-source wordnet editor that has been developed to ensure further expansion of the Romanian wordnet.

Paper
Add Code

Use Case: Romanian Language Resources in the LOD Paradigm

no code implementations • LDL (ACL) 2022 • Verginica Barbu Mititelu, Elena Irimia, Vasile Pais, Andrei-Marius Avram, Maria Mitrofan

In this paper, we report on (i) the conversion of Romanian language resources to the Linked Open Data specifications and requirements, on (ii) their publication and (iii) interlinking with other language resources (for Romanian or for other languages).

Word Embeddings

Paper
Add Code

Romanian Language Translation in the RELATE Platform

no code implementations • loresmt (COLING) 2022 • Vasile Pais, Maria Mitrofan, Andrei-Marius Avram

This paper presents the usage of the RELATE platform for translation tasks involving the Romanian language.

Translation

Paper
Add Code

Dialect Identification through Adversarial Learning and Knowledge Distillation on Romanian BERT

no code implementations • EACL (VarDial) 2021 • George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Traian Rebedea

Dialect identification is a task with applicability in a vast array of domains, ranging from automatic speech recognition to opinion mining.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Exploring the Power of Romanian BERT for Dialect Identification

no code implementations • VarDial (COLING) 2020 • George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Traian Rebedea

Dialect identification represents a key aspect for improving a series of tasks, for example, opinion mining, considering that the location of the speaker can greatly influence the attitude towards a subject.

Dialect Identification Opinion Mining

Paper
Add Code

HistNERo: Historical Named Entity Recognition for the Romanian Language

no code implementations • 30 Apr 2024 • Andrei-Marius Avram, Andreea Iuga, George-Vlad Manolache, Vlad-Cristian Matei, Răzvan-Gabriel Micliuş, Vlad-Andrei Muntean, Manuel-Petru Sorlescu, Dragoş-Andrei Şerban, Adrian-Dinu Urse, Vasile Păiş, Dumitru-Clementin Cercel

This work introduces HistNERo, the first Romanian corpus for Named Entity Recognition (NER) in historical newspapers.

Domain Adaptation named-entity-recognition +2

Paper
Add Code

End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition

no code implementations • 7 Oct 2023 • Emilian-Claudiu Mănescu, Răzvan-Alexandru Smădu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop

Lip reading or visual speech recognition has gained significant attention in recent years, particularly because of hardware development and innovations in computer vision.

Domain Adaptation Lip Reading +2

Paper
Add Code

Towards Improving the Performance of Pre-Trained Speech Models for Low-Resource Languages Through Lateral Inhibition

no code implementations • 30 Jun 2023 • Andrei-Marius Avram, Răzvan-Alexandru Smădu, Vasile Păiş, Dumitru-Clementin Cercel, Radu Ion, Dan Tufiş

With the rise of bidirectional encoder representations from Transformer models in natural language processing, the speech community has adopted some of their development methodologies.

Paper
Add Code

Multilingual Multiword Expression Identification Using Lateral Inhibition and Domain Adaptation

no code implementations • 17 Jun 2023 • Andrei-Marius Avram, Verginica Barbu Mititelu, Vasile Păiş, Dumitru-Clementin Cercel, Ştefan Trăuşan-Matu

Correctly identifying multiword expressions (MWEs) is an important task for most natural language processing systems since their misidentification can result in ambiguity and misunderstanding of the underlying text.

Domain Adaptation

Paper
Add Code

Adversarial Capsule Networks for Romanian Satire Detection and Sentiment Analysis

no code implementations • 13 Jun 2023 • Sebastian-Vasile Echim, Răzvan-Alexandru Smădu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop

Satire detection and sentiment analysis are intensively explored natural language processing (NLP) tasks that study the identification of the satirical tone from texts and extracting sentiments in relationship with their targets.

Satire Detection Sentiment Analysis

Paper
Add Code

RoBERTweet: A BERT Language Model for Romanian Tweets

no code implementations • 11 Jun 2023 • Iulian-Marius Tăiatu, Andrei-Marius Avram, Dumitru-Clementin Cercel, Florin Pop

Developing natural language processing (NLP) systems for social media analysis remains an important topic in artificial intelligence research.

Language Identification Language Modelling +2

Paper
Add Code

Romanian Multiword Expression Detection Using Multilingual Adversarial Training and Lateral Inhibition

no code implementations • 22 Apr 2023 • Andrei-Marius Avram, Verginica Barbu Mititelu, Dumitru-Clementin Cercel

Multiword expressions are a key ingredient for developing large-scale and linguistically sound natural language processing technology.

Paper
Add Code

TA-DA: Topic-Aware Domain Adaptation for Scientific Keyphrase Identification and Classification (Student Abstract)

no code implementations • 30 Dec 2022 • Răzvan-Alexandru Smădu, George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Mihai Dascalu, Florin Pop

Keyphrase identification and classification is a Natural Language Processing and Information Retrieval task that involves extracting relevant groups of words from a given text related to the main topic.

Domain Adaptation Information Retrieval +3

Paper
Add Code

An Open-Domain QA System for e-Governance

no code implementations • CLIB 2022 • Radu Ion, Andrei-Marius Avram, Vasile Păiş, Maria Mitrofan, Verginica Barbu Mititelu, Elena Irimia, Valentin Badea

The paper will present the QA system and its integration with the Romanian language technologies portal RELATE, the COVID-19 data set and different evaluations of the QA performance.

Open-Domain Question Answering

Paper
Add Code

Distilling the Knowledge of Romanian BERTs Using Multiple Teachers

1 code implementation • LREC 2022 • Andrei-Marius Avram, Darius Catrina, Dumitru-Clementin Cercel, Mihai Dascălu, Traian Rebedea, Vasile Păiş, Dan Tufiş

In this work, we introduce three light and fast versions of distilled BERT models for the Romanian language: Distil-BERT-base-ro, Distil-RoBERT-base, and DistilMulti-BERT-base-ro.

Dialect Identification Knowledge Distillation +9

Paper
Code

Romanian Speech Recognition Experiments from the ROBIN Project

1 code implementation • 23 Nov 2021 • Andrei-Marius Avram, Vasile Păiş, Dan Tufiş

One of the fundamental functionalities for accepting a socially assistive robot is its communication capabilities with other agents in the environment.

Language Modelling speech-recognition +1

Paper
Code

Human-Machine Interaction Speech Corpus from the ROBIN project

no code implementations • 22 Nov 2021 • Vasile Păiş, Radu Ion, Andrei-Marius Avram, Elena Irimia, Verginica Barbu Mititelu, Maria Mitrofan

The paper contains a detailed description of the acquisition process, corpus statistics as well as an evaluation of the corpus influence on a low-latency ASR system as well as a dialogue component.