Search Results for author: Cagri Toraman

Found 11 papers, 4 papers with code

D2U: Distance-to-Uniform Learning for Out-of-Scope Detection

no code implementations • NAACL 2022 • Eyup Yilmaz, Cagri Toraman

Supervised training with cross-entropy loss implicitly forces models to produce probability distributions that follow a discrete delta distribution.

Paper
Add Code

LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Language

1 code implementation • 13 May 2024 • Cagri Toraman

Despite advancements in English-dominant generative large language models, further development is needed for low-resource languages to enhance global accessibility.

Paper
Code

PejorativITy: Disambiguating Pejorative Epithets to Improve Misogyny Detection in Italian Tweets

no code implementations • 3 Apr 2024 • Arianna Muti, Federico Ruggeri, Cagri Toraman, Lorenzo Musetti, Samuel Algherini, Silvia Ronchi, Gianmarco Saretto, Caterina Zapparoli, Alberto Barrón-Cedeño

Disambiguating the meaning of such terms might help the detection of misogyny.

Sentence Word Embeddings +1

Paper
Add Code

ARC-NLP at PAN 2023: Transition-Focused Natural Language Inference for Writing Style Detection

no code implementations • 27 Jul 2023 • Izzet Emre Kucukkaya, Umitcan Sahin, Cagri Toraman

For the easy and medium setups, we submit transition-focused natural language inference based on DeBERTa with warmup training, and the same model without transition for the hard setup.

Natural Language Inference

Paper
Add Code

ARC-NLP at PAN 2023: Hierarchical Long Text Classification for Trigger Detection

no code implementations • 27 Jul 2023 • Umitcan Sahin, Izzet Emre Kucukkaya, Cagri Toraman

In this paper, we describe our approach for the Trigger Detection shared task at PAN CLEF 2023, where we want to detect multiple triggering content in a given Fanfiction document.

text-classification Text Classification

Paper
Add Code

ARC-NLP at Multimodal Hate Speech Event Detection 2023: Multimodal Methods Boosted by Ensemble Learning, Syntactical and Entity Features

no code implementations • 25 Jul 2023 • Umitcan Sahin, Izzet Emre Kucukkaya, Oguzhan Ozcelik, Cagri Toraman

Throughout the Russia-Ukraine war, both opposing factions heavily relied on text-embedded images as a vehicle for spreading propaganda and hate speech.

Ensemble Learning Event Detection +2

Paper
Add Code

Tweets Under the Rubble: Detection of Messages Calling for Help in Earthquake Disaster

1 code implementation • 26 Feb 2023 • Cagri Toraman, Izzet Emre Kucukkaya, Oguzhan Ozcelik, Umitcan Sahin

The importance of social media is again exposed in the recent tragedy of the 2023 Turkey and Syria earthquake.

Paper
Code

Not Good Times for Lies: Misinformation Detection on the Russia-Ukraine War, COVID-19, and Refugees

1 code implementation • 11 Oct 2022 • Cagri Toraman, Oguzhan Ozcelik, Furkan Şahinuç, Fazli Can

Misinformation spread in online social networks is an urgent-to-solve problem having harmful consequences that threaten human health, public safety, economics, and so on.

Descriptive Misinformation

Paper
Code

Impact of Tokenization on Language Models: An Analysis for Turkish

no code implementations • 19 Apr 2022 • Cagri Toraman, Eyup Halit Yilmaz, Furkan Şahinuç, Oguzhan Ozcelik

Furthermore, we find that increasing the vocabulary size improves the performance of Morphological and Word-level tokenizers more than that of de facto tokenizers.

Paper
Add Code

Large-Scale Hate Speech Detection with Cross-Domain Transfer

1 code implementation • LREC 2022 • Cagri Toraman, Furkan Şahinuç, Eyup Halit Yilmaz

The experimental results supported by statistical tests show that Transformer-based language models outperform conventional bag-of-words and neural models by at least 5% in English and 10% in Turkish for large-scale hate speech detection.

Hate Speech Detection Transfer Learning

Paper
Code

ConQX: Semantic Expansion of Spoken Queries for Intent Detection based on Conditioned Text Generation

no code implementations • 2 Sep 2021 • Eyup Halit Yilmaz, Cagri Toraman

To provide additional information regarding the query and enhance the performance of intent detection, we propose a method for semantic expansion of spoken queries, called ConQX, which utilizes the text generation ability of an auto-regressive language model, GPT-2.

Few-Shot Learning Intent Detection +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.