no code implementations • LREC 2022 • Francesco Mambrini, Marco Passarotti, Giovanni Moretti, Matteo Pellegrini
Although the Universal Dependencies initiative today allows for cross-linguistically consistent annotation of morphology and syntax in treebanks for several languages, syntactically annotated corpora are not yet interoperable with many lexical resources that describe properties of the words that occur therein.
no code implementations • LDL (ACL) 2022 • Margherita Fantoli, Marco Passarotti, Francesco Mambrini, Giovanni Moretti, Paolo Ruffolo
This paper describes the process of interlinking the 130 Classical Latin texts provided by an annotated corpus developed at the LASLA laboratory with the LiLa Knowledge Base, which makes linguistic resources for Latin interoperable by following the principles of the Linked Data paradigm and making reference to classes and properties of widely adopted ontologies to model the relevant information.
no code implementations • LREC 2020 • Francesco Mambrini, Marco Passarotti
In this paper we describe the process of inclusion of etymological information in a knowledge base of interoperable Latin linguistic resources developed in the context of the LiLa: Linking Latin project.
no code implementations • WS 2019 • Francesco Mambrini, Marco Passarotti
The interoperability between lemmatized corpora of Latin and other resources that use the lemma as indexing key is hampered by the multiple lemmatization strategies that different projects adopt.
no code implementations • LREC 2012 • Marco Passarotti, Francesco Mambrini
Although lexicography of Latin has a long tradition dating back to ancient grammarians, and almost all Latin grammars devote to wordformation at least one part of the section(s) concerning morphology, none of the today available lexical resources and NLP tools of Latin feature a wordformation-based organization of the Latin lexicon.