no code implementations • 17 Mar 2024 • Mohamed Taher Alrefaie, Nour Eldin Morsy, Nada Samir
This paper presents a comprehensive examination of the impact of tokenization strategies and vocabulary sizes on the performance of Arabic language models in downstream natural language processing tasks.