no code implementations • 28 Mar 2023 • Jan Idziak, Artjoms Šeļa, Michał Woźniak, Albert Leśniak, Joanna Byszuk, Maciej Eder
Our study provides a working solution that reads the cards, and links their lemmas to a searchable list of dictionary entries, for a large historical dictionary entitled the Dictionary of the 17th- and 18th-century Polish, which comprizes 2. 8 million index cards.
no code implementations • 13 Jan 2023 • Artjoms Šeļa, Ben Nagy, Joanna Byszuk, Laura Hernández-Lorenzo, Botond Szemes, Maciej Eder
Stylometry is mostly applied to authorial style.
no code implementations • 2 Nov 2022 • Maciej Eder
In this paper, I introduce a simple method of computing relative word frequencies for authorship attribution and similar stylometric tasks.
1 code implementation • 5 Jun 2022 • Maciej Eder, Rafał. L. Górski
In inflected languages, word endings play a prominent role, and hence different word forms cannot be recognized using generic text tokenization.
1 code implementation • 13 Apr 2021 • Rafał L. Górski, Maciej Eder
In our study, we apply logistic regression models to 9 changes which occurred between 15th and 18th century in the Polish language.
no code implementations • LREC 2020 • Joanna Byszuk, Micha{\l} Wo{\'z}niak, Mike Kestemont, Albert Le{\'s}niak, Wojciech {\L}ukasik, Artjoms {\v{S}}e{\c{l}}a, Maciej Eder
Fictional prose can be broadly divided into narrative and discursive forms with direct speech being central to any discourse representation (alongside indirect reported speech and free indirect discourse).