1 code implementation • WASSA (ACL) 2022 • Leo Obadić, Martin Tutek, Jan Šnajder
Twitter has slowly but surely established itself as a forum for disseminating, analysing and promoting NLP research.
no code implementations • 18 Apr 2024 • Laura Majer, Jan Šnajder
The increasing threat of disinformation calls for automating parts of the fact-checking pipeline.
no code implementations • 31 Mar 2024 • Josip Jukić, Jan Šnajder
Enhancing generalization and uncertainty quantification in pre-trained language models (PLMs) is crucial for their effectiveness and reliability.
no code implementations • 1 Mar 2024 • Jana Juroš, Laura Majer, Jan Šnajder
Drawing parallels with annotation paradigms for subjective tasks, we explore the influence of prompt design on the performance of LLMs for TSA of news headlines.
no code implementations • 20 Feb 2024 • Ivan Rep, David Dukić, Jan Šnajder
While BERT produces high-quality sentence embeddings, its pre-training computational cost is a significant drawback.
no code implementations • 25 Jan 2024 • David Dukić, Jan Šnajder
While fine-tuned MLM-based encoders consistently outperform causal language modeling decoders of comparable size, recent decoder-only large language models (LLMs) perform on par with smaller MLM-based encoders.
1 code implementation • 4 Oct 2023 • Fran Jelenić, Josip Jukić, Martin Tutek, Mate Puljiz, Jan Šnajder
Effective out-of-distribution (OOD) detection is crucial for reliable machine learning models, yet most current methods are limited in practical use due to requirements like access to training data or intervention in training.
Out-of-Distribution Detection Out of Distribution (OOD) Detection +2
1 code implementation • 23 May 2023 • David Dukić, Kiril Gashteovski, Goran Glavaš, Jan Šnajder
We address the problem of negative transfer in TD by coupling triggers between domains using subject-object relations obtained from a rule-based open information extraction (OIE) system.
1 code implementation • 23 May 2023 • Josip Jukić, Jan Šnajder
Pre-trained language models (PLMs) have ignited a surge in demand for effective fine-tuning techniques, particularly in low-resource domains and languages.
no code implementations • 20 May 2023 • Zoran Medić, Jan Šnajder
Citation recommendation (CR) models may help authors find relevant articles at various stages of the paper writing process.
1 code implementation • 16 May 2023 • Fran Jelenić, Josip Jukić, Nina Drobac, Jan Šnajder
We link the AL dataset transferability to the similarity of instances queried by the different PLMs and show that AL methods with similar acquisition sequences produce highly transferable datasets regardless of the models used.
no code implementations • 22 Feb 2023 • Domagoj Pluščec, Jan Šnajder
Data scarcity is a problem that occurs in languages and tasks where we do not have large amounts of labeled data but want to use state-of-the-art models.
no code implementations • 1 Feb 2023 • Josip Jukić, Iva Vukojević, Jan Šnajder
Expressing attitude or stance toward entities and concepts is an integral part of human behavior and personality.
no code implementations • 20 Dec 2022 • Josip Jukić, Jan Šnajder
Developed to alleviate prohibitive labeling costs, active learning (AL) methods aim to reduce label complexity in supervised learning.
no code implementations • 15 Nov 2022 • Josip Jukić, Martin Tutek, Jan Šnajder
By connecting our findings to instance categories based on training dynamics, we show that the agreement of saliency method explanations is very low for easy-to-learn instances.
no code implementations • 11 Nov 2022 • Josip Jukić, Fran Jelenić, Miroslav Bićanić, Jan Šnajder
Supervised machine learning has become the cornerstone of today's data-driven society, increasing the need for labeled data.
1 code implementation • sdp (COLING) 2022 • Zoran Medić, Jan Šnajder
As a remedy for the limitations of the existing benchmarks, we propose a new benchmark dataset for evaluating scientific article representations: Multi-Domain Citation Recommendation dataset (MDCR), which covers different scientific fields and contains challenging candidate pools.
1 code implementation • 11 Dec 2020 • Damir Korenčić, Strahil Ristov, Jelena Repar, Jan Šnajder
When topic models are used for discovery of topics in text collections, a question that arises naturally is how well the model-induced topics correspond to topics of interest to the analyst.
no code implementations • WS 2020 • Martin Tutek, Jan Šnajder
The attention mechanism has quickly become ubiquitous in NLP.
no code implementations • NAACL (SocialNLP) 2021 • Matej Gjurković, Mladen Karan, Iva Vukojević, Mihaela Bošnjak, Jan Šnajder
Personality and demographics are important variables in social sciences, while in NLP they can aid in interpretability and removal of societal biases.
no code implementations • 12 Nov 2018 • Ivan Sekulić, Matej Gjurković, Jan Šnajder
Bipolar disorder, an illness characterized by manic and depressive episodes, affects more than 60 million people worldwide.
no code implementations • WS 2018 • Martin Tutek, Jan Šnajder
Natural language processing has greatly benefited from the introduction of the attention mechanism.
no code implementations • Thirty-Second AAAI Conference on Artificial Intelligence 2018 • Domagoj Alagić, Jan Šnajder, Sebastian Padó
Word sense induction is the most prominent unsupervised approach to lexical disambiguation.
no code implementations • 31 Dec 2016 • Jan Šnajder
Argumentation mining from social media content has attracted increasing attention.