no code implementations • loresmt (AACL) 2020 • Renz Iver Baliber, Charibeth Cheng, Kristine Mae Adlaon, Virgion Mamonong
The Philippines is home to more than 150 languages that is considered to be low-resourced even on its major languages.
no code implementations • 7 Apr 2022 • Dan John Velasco, Axel Alba, Trisha Gail Pelagio, Bryce Anthony Ramirez, Unisse Chua, Briane Paul Samson, Jan Christian Blaise Cruz, Charibeth Cheng
The resulting sense inventory and synonym sets can be used in automatically creating a wordnet.
no code implementations • 6 Apr 2022 • Gabriel Louis Tan, Adrian Paule Ty, Schuyler Ng, Denzel Adrian Co, Jan Christian Blaise Cruz, Charibeth Cheng
Lastly, we published the first Filipino conversational response generator capable of generating responses related to the previous 3 responses.
no code implementations • LREC 2022 • Jan Christian Blaise Cruz, Charibeth Cheng
In this paper, we improve on existing language resources for the low-resource Filipino language in two ways.
1 code implementation • 22 Oct 2020 • Jan Christian Blaise Cruz, Jose Kristian Resabal, James Lin, Dan John Velasco, Charibeth Cheng
Lastly, we perform analyses on transfer learning techniques to shed light on their true performance when operating in low-data domains through the use of degradation tests.
1 code implementation • 5 May 2020 • Jan Christian Blaise Cruz, Charibeth Cheng
We analyze our pretrained model's degradation speeds and look towards the use of this method for comparing models aimed at operating within the low-resource setting.
4 code implementations • 3 May 2020 • Luis Enrico Lopez, Diane Kathryn Cruz, Jan Christian Blaise Cruz, Charibeth Cheng
Question generation (QG) is a natural language generation task where a model is trained to ask questions corresponding to some input text.
no code implementations • WS 2019 • Jared Rivera, Jan Caleb Oliver Pensica, Jolene Valenzuela, Alfonso Secuya, Charibeth Cheng
The SWBD-DAMSL tagset for DA classification was modified to 28 tags fitting the categories applicable to e-commerce conversations.
1 code implementation • LREC 2020 • Jan Christian Blaise Cruz, Julianne Agatha Tan, Charibeth Cheng
Second, we benchmark Transfer Learning (TL) techniques and show that they can be used to train robust fake news classifiers from little data, achieving 91% accuracy on our fake news dataset, reducing the error by 14% compared to established few-shot baselines.
2 code implementations • 30 Jun 2019 • Jan Christian Blaise Cruz, Charibeth Cheng
Unlike mainstream languages (such as English and French), low-resource languages often suffer from a lack of expert-annotated corpora and benchmark resources that make it hard to apply state-of-the-art techniques directly.
no code implementations • WS 2018 • Edward Tighe, Charibeth Cheng
Recent studies in the field of text-based personality recognition experiment with different languages, feature extraction techniques, and machine learning algorithms to create better and more accurate models; however, little focus is placed on exploring the language use of a group of individuals defined by nationality.