no code implementations • 10 Feb 2024 • Nefeli Gkouti, Prodromos Malakasiotis, Stavros Toumpis, Ion Androutsopoulos
However, the choice of optimizer during training has not been explored as extensively.
no code implementations • 10 Nov 2023 • Lefteris Loukas, Ilias Stogiannidis, Odysseas Diamantopoulos, Prodromos Malakasiotis, Stavros Vassos
Standard Full-Data classifiers in NLP demand thousands of labeled examples, which is impractical in data-limited domains.
1 code implementation • 20 Oct 2023 • Ilias Stogiannidis, Stavros Vassos, Prodromos Malakasiotis, Ion Androutsopoulos
We propose a framework that allows reducing the calls to LLMs by caching previous LLM responses and using them to train a local inexpensive model on the SME side.
Ranked #2 on Intent Detection on BANKING77
no code implementations • 28 Aug 2023 • Lefteris Loukas, Ilias Stogiannidis, Prodromos Malakasiotis, Stavros Vassos
We propose the use of conversational GPT models for easy and quick few-shot text classification in the financial domain using the Banking77 dataset.
no code implementations • 24 Oct 2022 • Stelios Maroudas, Sotiris Legkas, Prodromos Malakasiotis, Ilias Chalkidis
In the era of billion-parameter-sized Language Models (LMs), start-ups have to follow trends and adapt their technology accordingly.
no code implementations • 11 Oct 2022 • Ilias Chalkidis, Xiang Dai, Manos Fergadiotis, Prodromos Malakasiotis, Desmond Elliott
Non-hierarchical sparse attention Transformer-based models, such as Longformer and Big Bird, are popular approaches to working with long documents.
1 code implementation • BioNLP (ACL) 2022 • Dimitris Pappas, Prodromos Malakasiotis, Ion Androutsopoulos
We study the effect of seven data augmentation (da) methods in factoid question answering, focusing on the biomedical domain, where obtaining training instances is particularly difficult.
1 code implementation • ACL 2022 • Lefteris Loukas, Manos Fergadiotis, Ilias Chalkidis, Eirini Spyropoulou, Prodromos Malakasiotis, Ion Androutsopoulos, Georgios Paliouras
We, therefore, introduce XBRL tagging as a new entity extraction task for the financial domain and release FiNER-139, a dataset of 1. 1M sentences with gold XBRL tags.
no code implementations • 29 Sep 2021 • Nikolaos Manginas, Prodromos Malakasiotis, Eirini Spyropoulou, Ion Androutsopoulos, Georgios Paliouras
Black-box decision models have been widely adopted both in industry and academia due to their excellent performance across many challenging tasks and domains.
1 code implementation • EMNLP (ECONLP) 2021 • Lefteris Loukas, Manos Fergadiotis, Ion Androutsopoulos, Prodromos Malakasiotis
We use EDGAR-CORPUS to train and release EDGAR-W2V, which are WORD2VEC embeddings for the financial domain.
no code implementations • NAACL 2021 • Ilias Chalkidis, Manos Fergadiotis, Dimitrios Tsarapatsanis, Nikolaos Aletras, Ion Androutsopoulos, Prodromos Malakasiotis
We also release a new dataset comprising European Court of Human Rights cases, including annotations for paragraph-level rationales.
no code implementations • EACL 2021 • Ilias Chalkidis, Manos Fergadiotis, Nikolaos Manginas, Eva Katakalou, Prodromos Malakasiotis
Major scandals in corporate history have urged the need for regulatory compliance, where organizations need to ensure that their controls (processes) comply with relevant laws, regulations, and policies.
no code implementations • 12 Jan 2021 • Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, Ion Androutsopoulos
Morpho-syntactic features in the form of POS tag and token shape embeddings, as well as context-aware ELMO embeddings do not improve performance.
no code implementations • EMNLP (spnlp) 2020 • Nikolaos Manginas, Ilias Chalkidis, Prodromos Malakasiotis
Although BERT is widely used by the NLP community, little is known about its inner workings.
no code implementations • Findings of the Association for Computational Linguistics 2020 • Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, Nikolaos Aletras, Ion Androutsopoulos
Thus we propose a systematic investigation of the available strategies when applying BERT in specialised domains.
1 code implementation • EMNLP 2020 • Ilias Chalkidis, Manos Fergadiotis, Sotiris Kotitsas, Prodromos Malakasiotis, Nikolaos Aletras, Ion Androutsopoulos
Furthermore, we show that Transformer-based approaches outperform the state-of-the-art in two of the datasets, and we propose a new state-of-the-art method which combines BERT with LWANs.
Multi-Label Classification Multi Label Text Classification +5
1 code implementation • 27 Aug 2020 • John Koutsikakis, Ilias Chalkidis, Prodromos Malakasiotis, Ion Androutsopoulos
We expect these resources to boost NLP research and applications for modern Greek.
no code implementations • IJCNLP 2019 • Stratos Xenouleas, Prodromos Malakasiotis, Marianna Apidianaki, Ion Androutsopoulos
We propose SUM-QE, a novel Quality Estimation model for summarization based on BERT.
no code implementations • NeurIPS Workshop Document_Intelligen 2019 • Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, Ion Androutsopoulos
We investigate contract element extraction.
1 code implementation • 2 Sep 2019 • Stratos Xenouleas, Prodromos Malakasiotis, Marianna Apidianaki, Ion Androutsopoulos
We propose SumQE, a novel Quality Estimation model for summarization based on BERT.
1 code implementation • ACL 2019 • Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, Ion Androutsopoulos
We consider Large-Scale Multi-Label Text Classification (LMTC) in the legal domain.
Ranked #1 on Multi-Label Text Classification on EUR-Lex
no code implementations • WS 2019 • Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, Nikolaos Aletras, Ion Androutsopoulos
We consider the task of Extreme Multi-Label Text Classification (XMTC) in the legal domain.
no code implementations • EMNLP 2017 • John Pavlopoulos, Prodromos Malakasiotis, Ion Androutsopoulos
Experimenting with a new dataset of 1. 6M user comments from a news portal and an existing dataset of 115K Wikipedia talk page comments, we show that an RNN operating on word embeddings outpeforms the previous state of the art in moderation, which used logistic regression or an MLP classifier with character or word n-grams.
no code implementations • WS 2017 • John Pavlopoulos, Prodromos Malakasiotis, Juli Bakagianni, Ion Androutsopoulos
Experimenting with a dataset of approximately 1. 6M user comments from a Greek news sports portal, we explore how a state of the art RNN-based moderation method can be improved by adding user embeddings, user type embeddings, user biases, or user type biases.
no code implementations • WS 2017 • John Pavlopoulos, Prodromos Malakasiotis, Ion Androutsopoulos
We also compare against a CNN and a word-list baseline, considering both fully automatic and semi-automatic moderation.
no code implementations • SEMEVAL 2016 • Dionysios Xenos, Panagiotis Theodorakakos, John Pavlopoulos, Prodromos Malakasiotis, Ion Androutsopoulos
Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2
no code implementations • LREC 2014 • Axel-Cyrille Ngonga Ngomo, Norman Heino, Ren{\'e} Speck, Prodromos Malakasiotis
We introduce the BIOASQ suite, a set of open-source Web tools for the creation, assessment and community-driven improvement of question answering benchmarks.
no code implementations • 18 Dec 2009 • Ion Androutsopoulos, Prodromos Malakasiotis
Paraphrasing methods recognize, generate, or extract phrases, sentences, or longer natural language expressions that convey almost the same information.