Search Results for author: Juuso Eronen

Found 8 papers, 0 papers with code

Cyberbullying Detection for Low-resource Languages and Dialects: Review of the State of the Art

no code implementations • 30 Aug 2023 • Tanjim Mahmud, Michal Ptaszynski, Juuso Eronen, Fumito Masui

Based on recognizing those research gaps, we provide some suggestions for improving the general research conduct in cyberbullying detection, with a primary focus on low-resource languages.

Abusive Language

Paper
Add Code

Improving Polish to English Neural Machine Translation with Transfer Learning: Effects of Data Volume and Language Similarity

no code implementations • 1 Jun 2023 • Juuso Eronen, Michal Ptaszynski, Karol Nowakowski, Zheng Lin Chia, Fumito Masui

This paper investigates the impact of data volume and the use of similar languages on transfer learning in a machine translation task.

Machine Translation Transfer Learning +1

Paper
Add Code

Zero-shot cross-lingual transfer language selection using linguistic similarity

no code implementations • 31 Jan 2023 • Juuso Eronen, Michal Ptaszynski, Fumito Masui

This allows us to select a more suitable transfer language which can be used to better leverage knowledge from high-resource languages in order to improve the performance of language applications lacking data.

Dependency Parsing named-entity-recognition +4

Paper
Add Code

Comparing Performance of Different Linguistically-Backed Word Embeddings for Cyberbullying Detection

no code implementations • 4 Jun 2022 • Juuso Eronen, Michal Ptaszynski, Fumito Masui

In most cases, word embeddings are learned only from raw tokens or in some cases, lemmas.

Word Embeddings

Paper
Add Code

Exploring the Potential of Feature Density in Estimating Machine Learning Classifier Performance with Application to Cyberbullying Detection

no code implementations • 4 Jun 2022 • Juuso Eronen, Michal Ptaszynski, Fumito Masui, Gniewosz Leliwa, Michal Wroczynski

In this research.

Paper
Add Code

Initial Study into Application of Feature Density and Linguistically-backed Embedding to Improve Machine Learning-based Cyberbullying Detection

no code implementations • 4 Jun 2022 • Juuso Eronen, Michal Ptaszynski, Fumito Masui, Gniewosz Leliwa, Michal Wroczynski, Mateusz Piech, Aleksander Smywinski-Pohl

In this research, we study the change in the performance of machine learning (ML) classifiers when various linguistic preprocessing methods of a dataset were used, with the specific focus on linguistically-backed embeddings in Convolutional Neural Networks (CNN).

Paper
Add Code

Transfer Language Selection for Zero-Shot Cross-Lingual Abusive Language Detection

no code implementations • 2 Jun 2022 • Juuso Eronen, Michal Ptaszynski, Fumito Masui, Masaki Arata, Gniewosz Leliwa, Michal Wroczynski

We study the selection of transfer languages for automatic abusive language detection.

Abusive Language Cross-Lingual Transfer +1

Paper
Add Code

Improving Classifier Training Efficiency for Automatic Cyberbullying Detection with Feature Density

no code implementations • 2 Nov 2021 • Juuso Eronen, Michal Ptaszynski, Fumito Masui, Aleksander Smywiński-Pohl, Gniewosz Leliwa, Michal Wroczynski

We study the effectiveness of Feature Density (FD) using different linguistically-backed feature preprocessing methods in order to estimate dataset complexity, which in turn is used to comparatively estimate the potential performance of machine learning (ML) classifiers prior to any training.

Sentiment Analysis

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.