Search Results for author: Samia Touileb

Found 24 papers, 13 papers with code

LTG-ST at NADI Shared Task 1: Arabic Dialect Identification using a Stacking Classifier

no code implementations • COLING (WANLP) 2020 • Samia Touileb

This paper presents our results for the Nuanced Arabic Dialect Identification (NADI) shared task of the Fifth Workshop for Arabic Natural Language Processing (WANLP 2020).

Dialect Identification regression

Paper
Add Code

Using Gender- and Polarity-Informed Models to Investigate Bias

no code implementations • ACL (GeBNLP) 2021 • Samia Touileb, Lilja Øvrelid, Erik Velldal

More specifically, we add information about the gender of critics and book authors when classifying the polarity of book reviews, and the polarity of the reviews when classifying the genders of authors and critics.

Language Modelling

Paper
Add Code

Gender and sentiment, critics and authors: a dataset of Norwegian book reviews

1 code implementation • GeBNLP (COLING) 2020 • Samia Touileb, Lilja Øvrelid, Erik Velldal

We also explore the differences in how this is done by male and female critics.

Paper
Code

Occupational Biases in Norwegian and Multilingual Language Models

1 code implementation • NAACL (GeBNLP) 2022 • Samia Touileb, Lilja Øvrelid, Erik Velldal

In this paper we explore how a demographic distribution of occupations, along gender dimensions, is reflected in pre-trained language models.

Descriptive

Paper
Code

Lexicon information in neural sentiment analysis: a multi-task learning approach

2 code implementations • WS (NoDaLiDa) 2019 • Jeremy Barnes, Samia Touileb, Lilja Øvrelid, Erik Velldal

This paper explores the use of multi-task learning (MTL) for incorporating external knowledge in neural models.

Multi-Task Learning Sentence +1

Paper
Code

JSEEGraph: Joint Structured Event Extraction as Graph Parsing

1 code implementation • 26 Jun 2023 • Huiling You, Samia Touileb, Lilja Øvrelid

We propose a graph-based event extraction framework JSEEGraph that approaches the task of event extraction as general graph parsing in the tradition of Meaning Representation Parsing.

Event Argument Extraction Event Extraction

Paper
Code

Learning Horn Envelopes via Queries from Large Language Models

no code implementations • 20 May 2023 • Sophie Blum, Raoul Koudijs, Ana Ozaki, Samia Touileb

We propose a new algorithm that aims at extracting the "tightest Horn approximation" of the target theory and that is guaranteed to terminate in exponential time (in the worst case) and in polynomial time if the target has polynomially many non-Horn examples.

Paper
Add Code

NorBench -- A Benchmark for Norwegian Language Models

1 code implementation • 6 May 2023 • David Samuel, Andrey Kutuzov, Samia Touileb, Erik Velldal, Lilja Øvrelid, Egil Rønningstad, Elina Sigdel, Anna Palatkina

We present NorBench: a streamlined suite of NLP tasks and probes for evaluating Norwegian language models (LMs) on standardized data splits and evaluation metrics.

Paper
Code

Measuring Normative and Descriptive Biases in Language Models Using Census Data

no code implementations • 12 Apr 2023 • Samia Touileb, Lilja Øvrelid, Erik Velldal

We investigate in this paper how distributions of occupations with respect to gender is reflected in pre-trained language models.

Descriptive

Paper
Add Code

Measuring Harmful Representations in Scandinavian Language Models

1 code implementation • 21 Nov 2022 • Samia Touileb, Debora Nozza

Scandinavian countries are perceived as role-models when it comes to gender equality.

Paper
Code

EventGraph at CASE 2021 Task 1: A General Graph-based Approach to Protest Event Extraction

1 code implementation • 18 Oct 2022 • Huiling You, David Samuel, Samia Touileb, Lilja Øvrelid

This paper presents our submission to the 2022 edition of the CASE 2021 shared task 1, subtask 4.

Event Extraction

Paper
Code

EventGraph: Event Extraction as Semantic Graph Parsing

1 code implementation • 16 Oct 2022 • Huiling You, David Samuel, Samia Touileb, Lilja Øvrelid

Event extraction therefore becomes a graph parsing problem, which provides the following advantages: 1) performing event detection and argument extraction jointly; 2) detecting and extracting multiple events from a piece of text; and 3) capturing the complicated interaction between event arguments and triggers.

Event Detection Event Extraction

Paper
Code

Annotating Norwegian Language Varieties on Twitter for Part-of-Speech

no code implementations • VarDial (COLING) 2022 • Petter Mæhlum, Andre Kåsen, Samia Touileb, Jeremy Barnes

We show that models trained on Universal Dependency (UD) data perform worse when evaluated against this dataset, and that models trained on Bokm{\aa}l generally perform better than those trained on Nynorsk.

POS

Paper
Add Code

NorDiaChange: Diachronic Semantic Change Dataset for Norwegian

1 code implementation • LREC 2022 • Andrey Kutuzov, Samia Touileb, Petter Mæhlum, Tita Ranveig Enstad, Alexandra Wittemann

We describe NorDiaChange: the first diachronic semantic change dataset for Norwegian.

Paper
Code

The interplay between language similarity and script on a novel multi-layer Algerian dialect corpus

1 code implementation • Findings (ACL) 2021 • Samia Touileb, Jeremy Barnes

However, the interplay between language similarity and difference in script on cross-lingual transfer is a less studied problem.

Cross-Lingual Transfer Part-Of-Speech Tagging +1

Paper
Code

NorDial: A Preliminary Corpus of Written Norwegian Dialect Use

1 code implementation • NoDaLiDa 2021 • Jeremy Barnes, Petter Mæhlum, Samia Touileb

Norway has a large amount of dialectal variation, as well as a general tolerance to its use in the public sphere.

Paper
Code

Identifying Sentiments in Algerian Code-switched User-generated Comments

no code implementations • LREC 2020 • Wafia Adouane, Samia Touileb, Jean-Philippe Bernardy

We present in this paper our work on Algerian language, an under-resourced North African colloquial Arabic variety, for which we built a comparably large corpus of more than 36, 000 code-switched user-generated comments annotated for sentiments.

Sentiment Analysis

Paper
Add Code

Named Entity Recognition without Labelled Data: A Weak Supervision Approach

1 code implementation • ACL 2020 • Pierre Lison, Aliaksandr Hubin, Jeremy Barnes, Samia Touileb

When in-domain labelled data is available, transfer learning techniques can be used to adapt existing NER models to the target domain.

named-entity-recognition Named Entity Recognition +2

124

Paper
Code

Measuring Diachronic Evolution of Evaluative Adjectives with Word Embeddings: the Case for English, Norwegian, and Russian

no code implementations • WS 2019 • Julia Rodina, Baksh, Daria aeva, Vadim Fomin, Andrey Kutuzov, Samia Touileb, Erik Velldal

We measure the intensity of diachronic semantic shifts in adjectives in English, Norwegian and Russian across 5 decades.

Word Embeddings

Paper
Add Code

Automatic identification of unknown names with specific roles

no code implementations • COLING 2018 • Samia Touileb, Truls Pedersen, Helle Sj{\o}vaag

Automatically identifying persons in a particular role within a large corpus can be a difficult task, especially if you don{'}t know who you are actually looking for.

Paper
Add Code

NoReC: The Norwegian Review Corpus

1 code implementation • LREC 2018 • Erik Velldal, Lilja Øvrelid, Eivind Alexander Bergem, Cathrine Stadsnes, Samia Touileb, Fredrik Jørgensen

As resources for sentiment analysis have so far been unavailable for Norwegian, NoReC represents a highly valuable and sought-after addition to Norwegian language technology.

Opinion Mining Sentiment Analysis

Paper
Code

Constructions: a New Unit of Analysis for Corpus-based Discourse Analysis

no code implementations • PACLIC 2014 • Samia Touileb, Andrew Salway

Paper
Add Code

Applying Grammar Induction to Text Mining

no code implementations • ACL 2014 • Andrew Salway, Samia Touileb

Paper
Add Code

Inducing Information Structures for Data-driven Text Analysis

no code implementations • WS 2014 • Andrew Salway, Samia Touileb, Endre Tvinnereim

Open Information Extraction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.