Search Results for author: Bettina Berendt

Found 19 papers, 9 papers with code

Measuring Fairness with Biased Rulers: A Comparative Study on Bias Metrics for Pre-trained Language Models

no code implementations • NAACL 2022 • Pieter Delobelle, Ewoenam Tokpo, Toon Calders, Bettina Berendt

We survey the literature on fairness metrics for pre-trained language models and experimentally evaluate compatibility, including both biases in language models and in their downstream tasks.

Attribute Fairness

Paper
Add Code

Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-Identification

1 code implementation • 2 May 2024 • Dimitri Staufer, Frank Pallas, Bettina Berendt

Such an approach, however, is inadequate for the whistleblowing scenario since it neglects further re-identification potential in textual features, including writing style.

Authorship Attribution

Paper
Code

Addressing the Regulatory Gap: Moving Towards an EU AI Audit Ecosystem Beyond the AIA by Including Civil Society

no code implementations • 26 Feb 2024 • David Hartmann, José Renato Laranjeira de Pereira, Chiara Streitbörger, Bettina Berendt

By considering the value of third-party audits and third-party data access in an audit ecosystem, we identify a regulatory gap in that the Artificial Intelligence Act does not provide access to data for researchers and civil society.

Paper
Add Code

Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation

no code implementations • 5 Oct 2023 • François Remy, Pieter Delobelle, Bettina Berendt, Kris Demuynck, Thomas Demeester

This one-to-many token mapping improves tremendously the initialization of the embedding table for the target language.

Word Translation

Paper
Add Code

Bias, diversity, and challenges to fairness in classification and automated text analysis. From libraries to AI and back

no code implementations • 7 Mar 2023 • Bettina Berendt, Özgür Karadeniz, Sercan Kiyak, Stefan Mertens, Leen d'Haenens

Libraries are increasingly relying on computational methods, including methods from Artificial Intelligence (AI).

Fairness

Paper
Add Code

Domain Adaptive Decision Trees: Implications for Accuracy and Fairness

1 code implementation • 27 Feb 2023 • Jose M. Alvarez, Kristen M. Scott, Salvatore Ruggieri, Bettina Berendt

In uses of pre-trained machine learning models, it is a known issue that the target population in which the model is being deployed may not have been reflected in the source population with which the model was trained.

Domain Adaptation Fairness

Paper
Code

How Far Can It Go?: On Intrinsic Gender Bias Mitigation for Text Classification

1 code implementation • 30 Jan 2023 • Ewoenam Tokpo, Pieter Delobelle, Bettina Berendt, Toon Calders

Considering that the end use of these language models is for downstream tasks like text classification, it is important to understand how these intrinsic bias mitigation strategies actually translate to fairness in downstream tasks and the extent of this.

Fairness text-classification +1

Paper
Code

Political representation bias in DBpedia and Wikidata as a challenge for downstream processing

no code implementations • 29 Dec 2022 • Ozgur Karadeniz, Bettina Berendt, Sercan Kiyak, Stefan Mertens, Leen d'Haenens

We describe a case study of the relative over- or under-representation of Belgian political parties between 1990 and 2020 in the English-language DBpedia, the Dutch-language DBpedia, and Wikidata, and highlight the many decisions needed with regard to the design of this data analysis and the assumptions behind it, as well as implications from the results.

Fairness

Paper
Add Code

RobBERT-2022: Updating a Dutch Language Model to Account for Evolving Language Use

no code implementations • 15 Nov 2022 • Pieter Delobelle, Thomas Winters, Bettina Berendt

To evaluate if our new model is a plug-in replacement for RobBERT, we introduce two additional criteria based on concept drift of existing tokens and alignment for novel tokens. We found that for certain language tasks this update results in a significant performance increase.

Language Modelling

Paper
Add Code

FairDistillation: Mitigating Stereotyping in Language Models

1 code implementation • 10 Jul 2022 • Pieter Delobelle, Bettina Berendt

Large pre-trained language models are successfully being used in a variety of tasks, across many languages.

Knowledge Distillation

Paper
Code

RobBERTje: a Distilled Dutch BERT Model

no code implementations • 28 Apr 2022 • Pieter Delobelle, Thomas Winters, Bettina Berendt

We found that the performance of the models using the shuffled versus non-shuffled datasets is similar for most tasks and that randomly merging subsequent sentences in a corpus creates models that train faster and perform better on tasks with long sequences.

Paper
Add Code

Measuring Fairness with Biased Rulers: A Survey on Quantifying Biases in Pretrained Language Models

1 code implementation • 14 Dec 2021 • Pieter Delobelle, Ewoenam Kwaku Tokpo, Toon Calders, Bettina Berendt

We survey the existing literature on fairness metrics for pretrained language models and experimentally evaluate compatibility, including both biases in language models as in their downstream tasks.

Attribute Fairness

Paper
Code

Whistleblower protection in the digital age -- why 'anonymous' is not enough. From technology to a wider view of governance

no code implementations • 4 Nov 2021 • Bettina Berendt, Stefan Schiffner

When technology enters applications and processes with a long tradition of controversial societal debate, multi-faceted new ethical and legal questions arise.

Cultural Vocal Bursts Intensity Prediction Fairness

Paper
Add Code

Measuring Shifts in Attitudes Towards COVID-19 Measures in Belgium Using Multilingual BERT

1 code implementation • 20 Apr 2021 • Kristen Scott, Pieter Delobelle, Bettina Berendt

We classify seven months' worth of Belgian COVID-related Tweets using multilingual BERT and relate them to their governments' COVID measures.

Paper
Code

Ethical Adversaries: Towards Mitigating Unfairness with Adversarial Machine Learning

1 code implementation • 14 May 2020 • Pieter Delobelle, Paul Temple, Gilles Perrouin, Benoît Frénay, Patrick Heymans, Bettina Berendt

These new examples are then used to retrain and improve the model in the first step.

BIG-bench Machine Learning Fairness

Paper
Code

RobBERT: a Dutch RoBERTa-based Language Model

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Pieter Delobelle, Thomas Winters, Bettina Berendt

Training a Dutch BERT model thus has a lot of potential for a wide range of Dutch NLP tasks.

Ranked #1 on Sentiment Analysis on DBRD

Fairness Language Modelling +1

194

Paper
Code

Time to Take Emoji Seriously: They Vastly Improve Casual Conversational Models

no code implementations • 30 Oct 2019 • Pieter Delobelle, Bettina Berendt

Graphical emoji are ubiquitous in modern-day online conversations.

Paper
Add Code

Computational Ad Hominem Detection

1 code implementation • ACL 2019 • Pieter Delobelle, Murilo Cunha, Eric Massip Cano, Jeroen Peperkamp, Bettina Berendt

Fallacies like the personal attack{---}also known as the ad hominem attack{---}are introduced in debates as an easy win, even though they provide no rhetorical contribution.

BIG-bench Machine Learning

Paper
Code

AI for the Common Good?! Pitfalls, challenges, and Ethics Pen-Testing

no code implementations • 30 Oct 2018 • Bettina Berendt

But what is the Common Good, and is it enough to want to be good?

Ethics

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.