no code implementations • 29 Apr 2024 • Wondimagegnhue Tsegaye Tufa, Ilia Markov, Piek Vossen
We conduct a series of experiments to determine the effect of the script and tokenizer used in the pre-trained model on the performance of the downstream task.
1 code implementation • 29 Apr 2024 • Wondimagegnhue Tsegaye Tufa, Ilia Markov, Piek Vossen
Toxic language remains an ongoing challenge on social media platforms, presenting significant issues for users and communities.
no code implementations • WS 2019 • Solomon Teferra Abate, Michael Melese, Martha Yifiru Tachbelie, Million Meshesha, Solomon Atinafu, Wondwossen Mulugeta, Yaregal Assabie, Hafte Abera, Biniyam Ephrem, Tewodros Gebreselassie, Wondimagegnhue Tsegaye Tufa, Amanuel Lemma, Tsegaye Andargie, Seifedin Shifaw
In this paper, we describe an attempt towards the development of parallel corpora for English and Ethiopian Languages, such as Amharic, Tigrigna, Afan-Oromo, Wolaytta and Ge{'}ez.