Search Results for author: Mihai Dascalu

Found 16 papers, 2 papers with code

Transformer-based Multi-Task Learning for Adverse Effect Mention Analysis in Tweets

no code implementations • NAACL (SMM4H) 2021 • George-Andrei Dima, Dumitru-Clementin Cercel, Mihai Dascalu

This paper presents our contribution to the Social Media Mining for Health Applications Shared Task 2021.

Paper
Add Code

Interpretable Identification of Cybersecurity Vulnerabilities from News Articles

no code implementations • RANLP 2021 • Pierre Frode de la Foret, Stefan Ruseti, Cristian Sandescu, Mihai Dascalu, Sebastien Travadel

Various automated processing pipelines grounded in Natural Language Processing techniques for text classification were introduced for the early identification of vulnerabilities starting from Open-Source Intelligence (OSINT) data, including news websites, blogs, and social media.

text-classification Text Classification

Paper
Add Code

UPB at FinCausal-2020, Tasks 1 & 2: Causality Analysis in Financial Documents using Pretrained Language Models

1 code implementation • FNP (COLING) 2020 • Marius Ionescu, Andrei-Marius Avram, George-Andrei Dima, Dumitru-Clementin Cercel, Mihai Dascalu

Financial causality detection is centered on identifying connections between different assets from financial news in order to improve trading strategies.

Binary Classification

Paper
Code

UPB @ ACTI: Detecting Conspiracies using fine tuned Sentence Transformers

no code implementations • 28 Sep 2023 • Andrei Paraschiv, Mihai Dascalu

Conspiracy theories have become a prominent and concerning aspect of online discourse, posing challenges to information integrity and societal trust.

Binary Classification Classification +3

Paper
Add Code

TA-DA: Topic-Aware Domain Adaptation for Scientific Keyphrase Identification and Classification (Student Abstract)

no code implementations • 30 Dec 2022 • Răzvan-Alexandru Smădu, George-Eduard Zaharia, Andrei-Marius Avram, Dumitru-Clementin Cercel, Mihai Dascalu, Florin Pop

Keyphrase identification and classification is a Natural Language Processing and Information Retrieval task that involves extracting relevant groups of words from a given text related to the main topic.

Domain Adaptation Information Retrieval +3

Paper
Add Code

UPB at SemEval-2022 Task 5: Enhancing UNITER with Image Sentiment and Graph Convolutional Networks for Multimedia Automatic Misogyny Identification

1 code implementation • SemEval (NAACL) 2022 • Andrei Paraschiv, Mihai Dascalu, Dumitru-Clementin Cercel

In recent times, the detection of hate-speech, offensive, or abusive language in online media has become an important topic in NLP research due to the exponential growth of social media and the propagation of such messages, as well as their impact.

Abusive Language Hate Speech Detection

Paper
Code

Domain Adaptation in Multilingual and Multi-Domain Monolingual Settings for Complex Word Identification

no code implementations • ACL 2022 • George-Eduard Zaharia, Răzvan-Alexandru Smădu, Dumitru-Clementin Cercel, Mihai Dascalu

Our model obtains a boost of up to 2. 42% in terms of Pearson Correlation Coefficients in contrast to vanilla training techniques, when considering the CompLex from the Lexical Complexity Prediction 2021 dataset.

Complex Word Identification Domain Adaptation +2

Paper
Add Code

UPB at SemEval-2021 Task 5: Virtual Adversarial Training for Toxic Spans Detection

no code implementations • SEMEVAL 2021 • Andrei Paraschiv, Dumitru-Clementin Cercel, Mihai Dascalu

The real-world impact of polarization and toxicity in the online sphere marked the end of 2020 and the beginning of this year in a negative way.

Toxic Spans Detection

Paper
Add Code

UPB at SemEval-2021 Task 1: Combining Deep Learning and Hand-Crafted Features for Lexical Complexity Prediction

no code implementations • SEMEVAL 2021 • George-Eduard Zaharia, Dumitru-Clementin Cercel, Mihai Dascalu

Our models are applicable on both subtasks and achieve good performance results, with a MAE below 0. 07 and a Person correlation of . 73 for single word identification, as well as a MAE below 0. 08 and a Person correlation of . 79 for multiple word targets.

Lexical Complexity Prediction Word Embeddings

Paper
Add Code

UPB at SemEval-2021 Task 7: Adversarial Multi-Task Learning for Detecting and Rating Humor and Offense

no code implementations • SEMEVAL 2021 • Răzvan-Alexandru Smădu, Dumitru-Clementin Cercel, Mihai Dascalu

Detecting humor is a challenging task since words might share multiple valences and, depending on the context, the same words can be even used in offensive expressions.

Multi-Task Learning text-classification +1

Paper
Add Code

UPB at SemEval-2021 Task 8: Extracting Semantic Information on Measurements as Multi-Turn Question Answering

no code implementations • SEMEVAL 2021 • Andrei-Marius Avram, George-Eduard Zaharia, Dumitru-Clementin Cercel, Mihai Dascalu

Extracting semantic information on measurements and counts is an important topic in terms of analyzing scientific discourses.

Question Answering Relation Extraction

Paper
Add Code

RoBERT -- A Romanian BERT Model

no code implementations • COLING 2020 • Mihai Masala, Stefan Ruseti, Mihai Dascalu

Deep pre-trained language models tend to become ubiquitous in the field of Natural Language Processing (NLP).

Sentiment Analysis Transfer Learning

Paper
Add Code

Cross-Lingual Transfer Learning for Complex Word Identification

no code implementations • 2 Oct 2020 • George-Eduard Zaharia, Dumitru-Clementin Cercel, Mihai Dascalu

Our aim is to provide evidence that the proposed models can learn the characteristics of complex words in a multilingual environment by relying on the CWI shared task 2018 dataset available for four different languages (i. e., English, German, Spanish, and also French).

Complex Word Identification Cross-Lingual Transfer +3

Paper
Add Code

UPB at SemEval-2020 Task 11: Propaganda Detection with Domain-Specific Trained BERT

no code implementations • SEMEVAL 2020 • Andrei Paraschiv, Dumitru-Clementin Cercel, Mihai Dascalu

Manipulative and misleading news have become a commodity for some online news outlets and these news have gained a significant impact on the global mindset of people.

Propaganda span identification

Paper
Add Code

Romanian Diacritics Restoration Using Recurrent Neural Networks

no code implementations • 6 Sep 2020 • Stefan Ruseti, Teodor-Mihai Cotet, Mihai Dascalu

Diacritics restoration is a mandatory step for adequately processing Romanian texts, and not a trivial one, as you generally need context in order to properly restore a character.

Paper
Add Code

Building a Comprehensive Romanian Knowledge Base for Drug Administration

no code implementations • RANLP 2019 • Bogdan Nicula, Mihai Dascalu, Maria-Dorinela S{\^\i}rbu, {\textcommabelow{S}}tefan Tr{\u{a}}u{\textcommabelow{s}}an-Matu, Alex Nu{\textcommabelow{t}}{\u{a}}, ru

Information on drug administration is obtained traditionally from doctors and pharmacists, as well as leaflets which provide in most cases cumbersome and hard-to-follow details.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.