no code implementations • 2 May 2024 • Aleksei Dorkin, Kairit Sirts
This paper presents the TartuNLP team submission to EvaLatin 2024 shared task of the emotion polarity detection for historical Latin texts.
no code implementations • 30 Apr 2024 • Aleksei Dorkin, Kairit Sirts
We present an information retrieval based reverse dictionary system using modern pre-trained language models and approximate nearest neighbors search algorithms.
no code implementations • 23 Apr 2024 • Aleksei Dorkin, Kairit Sirts
This study evaluates three different lemmatization approaches to Estonian -- Generative character-level models, Pattern-based word-level classification models, and rule-based morphological analysis.
no code implementations • 19 Apr 2024 • Aleksei Dorkin, Kairit Sirts
We present our submission to the unconstrained subtask of the SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages for morphological annotation, POS-tagging, lemmatization, character- and word-level gap-filling.