2 code implementations • 24 Aug 2023 • György Orosz, Gergő Szabó, Péter Berkecz, Zsolt Szántó, Richárd Farkas
This paper presents a set of industrial-grade text processing models for Hungarian that achieve near state-of-the-art performance while balancing resource efficiency and accuracy.
1 code implementation • 13 Jun 2023 • Péter Berkecz, György Orosz, Zsolt Szántó, Gergő Szabó, Richárd Farkas
Lemmatization is still not a trivial task for morphologically rich languages.
1 code implementation • 6 Jan 2022 • György Orosz, Zsolt Szántó, Péter Berkecz, Gergő Szabó, Richárd Farkas
Although there are a couple of open-source language processing pipelines available for Hungarian, none of them satisfies the requirements of today's NLP applications.
no code implementations • Association for Computational Linguistics 2008 • György Szarvas, Veronika Vincze, Richárd Farkas, János Csirik
This article reports on a corpus annotation project that has produced a freely available resource for research on handling negation and uncertainty in biomedical texts (we call this corpus the BioScope corpus).