no code implementations • LREC 2022 • Aleksandra Miletic, Christophe Benzitoun, Georgeta Cislaru, Santiago Herrera-Yanez
Pro-TEXT is a corpus of keystroke logs written in French.
no code implementations • VarDial (COLING) 2022 • Aleksandra Miletic, Yves Scherrer
This paper presents OcWikiDisc, a new freely available corpus in Occitan, as well as language identification experiments on Occitan done as part of the corpus building process.
no code implementations • VarDial (COLING) 2020 • Aleksandra Miletic, Myriam Bras, Marianne Vergez-Couret, Louise Esher, Clamença Poujade, Jean Sibille
Occitan is a Romance language spoken mainly in the south of France.
no code implementations • 29 Apr 2024 • Dana Roemling, Yves Scherrer, Aleksandra Miletic
Forensic authorship profiling uses linguistic markers to infer characteristics about an author of a text.