Search Results for author: Sin-En Lu

Found 3 papers, 1 papers with code

BRCC and SentiBahasaRojak: The First Bahasa Rojak Corpus for Pretraining and Sentiment Analysis Dataset

no code implementations COLING 2022 Nanda Putri Romadhona, Sin-En Lu, Bo-Han Lu, Richard Tzong-Han Tsai

Finally, to test the effectiveness of the Mixed XLM model pre-trained on BRCC for social media scenarios where code-mixing is found frequently, we compile a new Bahasa Rojak sentiment analysis dataset, SentiBahasaRojak, with a Kappa value of 0. 77.

Data Augmentation Sentiment Analysis +1

A Survey of Approaches to Automatic Question Generation:from 2019 to Early 2021

no code implementations ROCLING 2021 Chao-Yi Lu, Sin-En Lu

To provide analysis of recent researches of automatic question generation from text, we surveyed 9 papers between 2019 to early 2021, retrieved from Paper with Code(PwC).

Question Generation Question-Generation

Exploring Methods for Building Dialects-Mandarin Code-Mixing Corpora: A Case Study in Taiwanese Hokkien

1 code implementation21 Jan 2023 Sin-En Lu, Bo-Han Lu, Chao-Yi Lu, Richard Tzong-Han Tsai

In natural language processing (NLP), code-mixing (CM) is a challenging task, especially when the mixed languages include dialects.

Language Modelling Transfer Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.