Search Results for author: Cagri Toraman

Found 11 papers, 4 papers with code

D2U: Distance-to-Uniform Learning for Out-of-Scope Detection

no code implementations NAACL 2022 Eyup Yilmaz, Cagri Toraman

Supervised training with cross-entropy loss implicitly forces models to produce probability distributions that follow a discrete delta distribution.

LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Language

1 code implementation13 May 2024 Cagri Toraman

Despite advancements in English-dominant generative large language models, further development is needed for low-resource languages to enhance global accessibility.

ARC-NLP at PAN 2023: Transition-Focused Natural Language Inference for Writing Style Detection

no code implementations27 Jul 2023 Izzet Emre Kucukkaya, Umitcan Sahin, Cagri Toraman

For the easy and medium setups, we submit transition-focused natural language inference based on DeBERTa with warmup training, and the same model without transition for the hard setup.

Natural Language Inference

ARC-NLP at PAN 2023: Hierarchical Long Text Classification for Trigger Detection

no code implementations27 Jul 2023 Umitcan Sahin, Izzet Emre Kucukkaya, Cagri Toraman

In this paper, we describe our approach for the Trigger Detection shared task at PAN CLEF 2023, where we want to detect multiple triggering content in a given Fanfiction document.

text-classification Text Classification

Tweets Under the Rubble: Detection of Messages Calling for Help in Earthquake Disaster

1 code implementation26 Feb 2023 Cagri Toraman, Izzet Emre Kucukkaya, Oguzhan Ozcelik, Umitcan Sahin

The importance of social media is again exposed in the recent tragedy of the 2023 Turkey and Syria earthquake.

Not Good Times for Lies: Misinformation Detection on the Russia-Ukraine War, COVID-19, and Refugees

1 code implementation11 Oct 2022 Cagri Toraman, Oguzhan Ozcelik, Furkan Şahinuç, Fazli Can

Misinformation spread in online social networks is an urgent-to-solve problem having harmful consequences that threaten human health, public safety, economics, and so on.

Descriptive Misinformation

Impact of Tokenization on Language Models: An Analysis for Turkish

no code implementations19 Apr 2022 Cagri Toraman, Eyup Halit Yilmaz, Furkan Şahinuç, Oguzhan Ozcelik

Furthermore, we find that increasing the vocabulary size improves the performance of Morphological and Word-level tokenizers more than that of de facto tokenizers.

Large-Scale Hate Speech Detection with Cross-Domain Transfer

1 code implementation LREC 2022 Cagri Toraman, Furkan Şahinuç, Eyup Halit Yilmaz

The experimental results supported by statistical tests show that Transformer-based language models outperform conventional bag-of-words and neural models by at least 5% in English and 10% in Turkish for large-scale hate speech detection.

Hate Speech Detection Transfer Learning

ConQX: Semantic Expansion of Spoken Queries for Intent Detection based on Conditioned Text Generation

no code implementations2 Sep 2021 Eyup Halit Yilmaz, Cagri Toraman

To provide additional information regarding the query and enhance the performance of intent detection, we propose a method for semantic expansion of spoken queries, called ConQX, which utilizes the text generation ability of an auto-regressive language model, GPT-2.

Few-Shot Learning Intent Detection +2

Cannot find the paper you are looking for? You can Submit a new open access paper.