no code implementations • LREC 2022 • Francesco Mambrini, Marco Passarotti, Giovanni Moretti, Matteo Pellegrini
Although the Universal Dependencies initiative today allows for cross-linguistically consistent annotation of morphology and syntax in treebanks for several languages, syntactically annotated corpora are not yet interoperable with many lexical resources that describe properties of the words that occur therein.
no code implementations • LDL (ACL) 2022 • Margherita Fantoli, Marco Passarotti, Francesco Mambrini, Giovanni Moretti, Paolo Ruffolo
This paper describes the process of interlinking the 130 Classical Latin texts provided by an annotated corpus developed at the LASLA laboratory with the LiLa Knowledge Base, which makes linguistic resources for Latin interoperable by following the principles of the Linked Data paradigm and making reference to classes and properties of widely adopted ontologies to model the relevant information.
no code implementations • LT4HALA (LREC) 2022 • Rachele Sprugnoli, Marco Passarotti, Flavio Massimiliano Cecchini, Margherita Fantoli, Giovanni Moretti
This paper describes the organization and the results of the second edition of EvaLatin, the campaign for the evaluation of Natural Language Processing tools for Latin.
no code implementations • LDL (ACL) 2022 • Christian Chiarcos, Katerina Gkirtzou, Fahad Khan, Penny Labropoulou, Marco Passarotti, Matteo Pellegrini
This paper describes the current status of the emerging OntoLex module for linguistic morphology.
no code implementations • LREC 2020 • Francesco Mambrini, Marco Passarotti
In this paper we describe the process of inclusion of etymological information in a knowledge base of interoperable Latin linguistic resources developed in the context of the LiLa: Linking Latin project.
no code implementations • LREC 2020 • Flavio Massimiliano Cecchini, Timo Korkiakangas, Marco Passarotti
The present work introduces a new Latin treebank that follows the Universal Dependencies (UD) annotation standard.
no code implementations • LREC 2020 • Rachele Sprugnoli, Marco Passarotti, Flavio Massimiliano Cecchini, Matteo Pellegrini
This also allows us to propose the Cross-genre and Cross-time subtasks for each task, in order to evaluate the portability of NLP tools for Latin across different genres and time periods.
no code implementations • LREC 2020 • Rachele Sprugnoli, Marco Passarotti, Daniela Corbetta, Andrea Peverelli
Sentiment lexicons are essential for developing automatic sentiment analysis systems, but the resources currently available mostly cover modern languages.
Cultural Vocal Bursts Intensity Prediction Sentiment Analysis
no code implementations • WS 2019 • Francesco Mambrini, Marco Passarotti
The interoperability between lemmatized corpora of Latin and other resources that use the lemma as indexing key is hampered by the multiple lemmatization strategies that different projects adopt.
no code implementations • WS 2018 • Flavio Massimiliano Cecchini, Marco Passarotti, Paola Marongiu, Daniel Zeman
The changes are made both to harmonise the Universal Dependencies version of the \textit{Index Thomisticus} Treebank with the two other available Latin treebanks and to fix errors and inconsistencies resulting from the original process.
no code implementations • LREC 2016 • Marco Passarotti, Berta Gonz{\'a}lez Saavedra, Christophe Onambele
This situation is strictly related to the still quite limited amount of linguistically annotated textual data for Latin, which can help the building of new lexical resources by supporting them with empirical evidence.
no code implementations • LREC 2016 • Edoardo Maria Ponti, Marco Passarotti
The Index Thomisticus Treebank is the largest available treebank for Latin; it contains Medieval Latin texts by Thomas Aquinas.
no code implementations • LREC 2014 • Scott Martens, Marco Passarotti
This paper describes the integration of the Index Thomisticus Treebank (IT-TB) into the web-based treebank search and visualization application TueNDRA (Tuebingen aNnotated Data Retrieval {\&} Analysis).
no code implementations • LREC 2014 • Chris Culy, Marco Passarotti, Ulla K{\"o}nig-Cardanobile
Assuming that language is a system, we describe a tool (using the DoubleTreeJS visualization) to visualize the results of querying dependency treebanks by forming a node from a single item type, and building a network in which the heads and the dependents of the central node are respectively the left and the right vertices of the tree, which are connected to the central node by dependency relations.
no code implementations • LREC 2012 • Marco Passarotti, Francesco Mambrini
Although lexicography of Latin has a long tradition dating back to ancient grammarians, and almost all Latin grammars devote to wordformation at least one part of the section(s) concerning morphology, none of the today available lexical resources and NLP tools of Latin feature a wordformation-based organization of the Latin lexicon.