no code implementations • WS 2017 • Jirka Hana, Barbora Hladk{\'a}
We present a pilot study on parsing non-native texts written by learners of Czech.
no code implementations • LREC 2014 • Adriane Boyd, Jirka Hana, Lionel Nicolas, Detmar Meurers, Katrin Wisniewski, Andrea Abel, Karin Sch{\"o}ne, Barbora {\v{S}}tindlov{\'a}, Chiara Vettori
The MERLIN corpus is a written learner corpus for Czech, German, and Italian that has been designed to illustrate the Common European Framework of Reference for Languages (CEFR) with authentic learner data.
no code implementations • LREC 2012 • Jirka Hana, Alex Rosen, R, Barbora {\v{S}}tindlov{\'a}, Petr J{\"a}ger
The paper describes a corpus of texts produced by non-native speakers of Czech.
no code implementations • LREC 2012 • Jirka Hana, Barbora Hladk{\'a}
We present a new way to get more morphologically and syntactically annotated data.