Search Results for author: Alex Mourachko

Found 3 papers, 3 papers with code

MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector

1 code implementation10 Jan 2024 Marta R. Costa-jussà, Mariano Coria Meglioli, Pierre Andrews, David Dale, Prangthip Hansanti, Elahe Kalbassi, Alex Mourachko, Christophe Ropers, Carleigh Wood

Research in toxicity detection in natural language processing for the speech modality (audio-based) is quite limited, particularly for languages other than English.

xSIM++: An Improved Proxy to Bitext Mining Performance for Low-Resource Languages

1 code implementation22 Jun 2023 Mingda Chen, Kevin Heffernan, Onur Çelebi, Alex Mourachko, Holger Schwenk

In comparison to xSIM, we show that xSIM++ is better correlated with the downstream BLEU scores of translation systems trained on mined bitexts, providing a reliable proxy of bitext mining performance without needing to run expensive bitext mining pipelines.

NMT

Cannot find the paper you are looking for? You can Submit a new open access paper.