no code implementations • PAIL (ICON) 2021 • Dineskumar Murugesapillai, Anankan Ravinthirarasa, Gihan Dias, Kengatharaiyer Sarveswaran
This paper describes an ongoing development of a grammar error checker for the Tamil language using a state-of-the-art deep neural-based approach.
no code implementations • 16 Jan 2024 • Kengatharaiyer Sarveswaran
The paper also highlights the complexity and richness of Tamil in terms of its morphological and syntactic features, which will be useful for linguists analysing the language and conducting comparative studies.
2 code implementations • 12 Sep 2023 • Wei Qi Leong, Jian Gang Ngui, Yosephine Susanto, Hamsawardhini Rengarajan, Kengatharaiyer Sarveswaran, William Chandra Tjhi
As GPT-4 is purportedly one of the best-performing multilingual LLMs at the moment, we use it as a yardstick to gauge the capabilities of LLMs in the context of SEA languages.
2 code implementations • ICON 2020 • Kengatharaiyer Sarveswaran, Gihan Dias
ThamizhiUDp uses Stanza for tokenisation and lemmatisation, ThamizhiPOSt and ThamizhiMorph for generating Part of Speech (POS) and Morphological annotations, and uuparser with multilingual training for dependency parsing.
no code implementations • WS 2019 • Kengatharaiyer Sarveswaran, Gihan Dias, Miriam Butt
This paper describes a new and larger coverage Finite-State Morphological Analyser (FSM) and Generator for the Dravidian language Tamil.