Search Results for author: Jana Straková

Found 13 papers, 7 papers with code

Czech Grammar Error Correction with a Large and Diverse Corpus

no code implementations14 Jan 2022 Jakub Náplava, Milan Straka, Jana Straková, Alexandr Rosen

We introduce a large and diverse Czech corpus annotated for grammatical error correction (GEC) with the aim to contribute to the still scarce data resources in this domain for languages other than English.

Grammatical Error Correction

Character Transformations for Non-Autoregressive GEC Tagging

1 code implementation WNUT (ACL) 2021 Milan Straka, Jakub Náplava, Jana Straková

We propose a character-based nonautoregressive GEC approach, with automatically generated character transformations.

Understanding Model Robustness to User-generated Noisy Texts

1 code implementation WNUT (ACL) 2021 Jakub Náplava, Martin Popel, Milan Straka, Jana Straková

We also compare two approaches to address the performance drop: a) training the NLP models with noised data generated by our framework; and b) reducing the input noise with external system for natural language correction.

Grammatical Error Correction Machine Translation +5

UDPipe at EvaLatin 2020: Contextualized Embeddings and Treebank Embeddings

no code implementations LREC 2020 Milan Straka, Jana Straková

We present our contribution to the EvaLatin shared task, which is the first evaluation campaign devoted to the evaluation of NLP tools for Latin.

Lemmatization POS +1

ÚFAL MRPipe at MRP 2019: UDPipe Goes Semantic in the Meaning Representation Parsing Shared Task

1 code implementation24 Oct 2019 Milan Straka, Jana Straková

We present a system description of our contribution to the CoNLL 2019 shared task, Cross-Framework Meaning Representation Parsing (MRP 2019).

Dependency Parsing Lemmatization +3

Czech Text Processing with Contextual Embeddings: POS Tagging, Lemmatization, Parsing and NER

no code implementations8 Sep 2019 Milan Straka, Jana Straková, Jan Hajič

We evaluate two meth ods for precomputing such embeddings, BERT and Flair, on four Czech text processing tasks: part-of-speech (POS) tagging, lemmatization, dependency pars ing and named entity recognition (NER).

Dependency Parsing Lemmatization +6

Evaluating Contextualized Embeddings on 54 Languages in POS Tagging, Lemmatization and Dependency Parsing

no code implementations20 Aug 2019 Milan Straka, Jana Straková, Jan Hajič

We present an extensive evaluation of three recently proposed methods for contextualized embeddings on 89 corpora in 54 languages of the Universal Dependencies 2. 3 in three tasks: POS tagging, lemmatization, and dependency parsing.

Dependency Parsing Lemmatization +3

Neural Architectures for Nested NER through Linearization

1 code implementation ACL 2019 Jana Straková, Milan Straka, Jan Hajič

We propose two neural network architectures for nested named entity recognition (NER), a setting in which named entities may overlap and also be labeled with more than one label.

Hard Attention named-entity-recognition +4

Cannot find the paper you are looking for? You can Submit a new open access paper.