Search Results for author: Charibeth Cheng

Found 11 papers, 5 papers with code

Improving Large-scale Language Models and Resources for Filipino

no code implementations LREC 2022 Jan Christian Blaise Cruz, Charibeth Cheng

In this paper, we improve on existing language resources for the low-resource Filipino language in two ways.

Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets

1 code implementation22 Oct 2020 Jan Christian Blaise Cruz, Jose Kristian Resabal, James Lin, Dan John Velasco, Charibeth Cheng

Lastly, we perform analyses on transfer learning techniques to shed light on their true performance when operating in low-data domains through the use of degradation tests.

Benchmarking Natural Language Inference +2

Establishing Baselines for Text Classification in Low-Resource Languages

1 code implementation5 May 2020 Jan Christian Blaise Cruz, Charibeth Cheng

We analyze our pretrained model's degradation speeds and look towards the use of this method for comparing models aimed at operating within the low-resource setting.

General Classification Multilabel Text Classification +2

Simplifying Paragraph-level Question Generation via Transformer Language Models

4 code implementations3 May 2020 Luis Enrico Lopez, Diane Kathryn Cruz, Jan Christian Blaise Cruz, Charibeth Cheng

Question generation (QG) is a natural language generation task where a model is trained to ask questions corresponding to some input text.

Language Modelling Question Generation +3

Localization of Fake News Detection via Multitask Transfer Learning

1 code implementation LREC 2020 Jan Christian Blaise Cruz, Julianne Agatha Tan, Charibeth Cheng

Second, we benchmark Transfer Learning (TL) techniques and show that they can be used to train robust fake news classifiers from little data, achieving 91% accuracy on our fake news dataset, reducing the error by 14% compared to established few-shot baselines.

Fake News Detection Language Modelling +1

Evaluating Language Model Finetuning Techniques for Low-resource Languages

2 code implementations30 Jun 2019 Jan Christian Blaise Cruz, Charibeth Cheng

Unlike mainstream languages (such as English and French), low-resource languages often suffer from a lack of expert-annotated corpora and benchmark resources that make it hard to apply state-of-the-art techniques directly.

Language Modelling

Modeling Personality Traits of Filipino Twitter Users

no code implementations WS 2018 Edward Tighe, Charibeth Cheng

Recent studies in the field of text-based personality recognition experiment with different languages, feature extraction techniques, and machine learning algorithms to create better and more accurate models; however, little focus is placed on exploring the language use of a group of individuals defined by nationality.

General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.