no code implementations • JEP/TALN/RECITAL 2022 • Nicolas Hiebel, Karën Fort, Aurélie Névéol, Olivier Ferret
Le TAL repose sur la disponibilité de corpus annotés pour l’entraînement et l’évaluation de modèles.
no code implementations • LREC 2022 • Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol
We introduce a definition of similarity that is guided by clinical facts and apply it to the development of a new French corpus of 1, 000 sentence pairs manually annotated according to similarity scores.
no code implementations • LREC 2020 • Ana{\"e}lle Baledent, Nicolas Hiebel, Ga{\"e}l Lejeune
The experiments presented in this article focused on documents written in French but we believe that the ability of character-level models to handle noise properly would help to achieve comparable results on other languages and more ancient languages in particular.