Search Results for author: Rahmad Mahendra

Found 21 papers, 7 papers with code

The Framework of Multiword Expression in Indonesian Language

no code implementations • PACLIC 2020 • Totok Suhardijanto, Rahmad Mahendra, Zahroh Nuriah, Adi Budiwiyanto

Paper
Add Code

Semantic Role Labeling in Conversational Chat using Deep Bi-Directional Long Short-Term Memory Networks with Attention Mechanism

no code implementations • PACLIC 2018 • Valdi Rachman, Rahmad Mahendra, Alfan Farizki Wicaksono, Ahmad Rizqi Meydiarso, Fariz Ikhwantri

Semantic Role Labeling

Paper
Add Code

ISWARA at WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets using BERT and FastText Embeddings

no code implementations • EMNLP (WNUT) 2020 • Wava Carissa Putri, Rani Aulia Hidayat, Isnaini Nurul Khasanah, Rahmad Mahendra

This paper presents Iswara’s participation in the WNUT-2020 Task 2 “Identification of Informative COVID-19 English Tweets using BERT and FastText Embeddings”, which tries to classify whether a certain tweet is considered informative or not.

Task 2 Word Embeddings

Paper
Add Code

A Multi-Pass Sieve Coreference Resolution for Indonesian

no code implementations • RANLP 2021 • Valentina Kania Prameswara Artari, Rahmad Mahendra, Meganingrum Arista Jiwanggi, Adityo Anggraito, Indra Budi

Coreference resolution is an NLP task to find out whether the set of referring expressions belong to the same concept in discourse.

coreference-resolution

Paper
Add Code

MultiLexNorm: A Shared Task on Multilingual Lexical Normalization

1 code implementation • EMNLP (WNUT) 2021 • Rob van der Goot, Alan Ramponi, Arkaitz Zubiaga, Barbara Plank, Benjamin Muller, Iñaki San Vicente Roncal, Nikola Ljubešić, Özlem Çetinoğlu, Rahmad Mahendra, Talha Çolakoğlu, Timothy Baldwin, Tommaso Caselli, Wladimir Sidorenko

This task is beneficial for downstream analysis, as it provides a way to harmonize (often spontaneous) linguistic variation.

Dependency Parsing Lexical Normalization +2

Paper
Code

Cross-Lingual and Supervised Learning Approach for Indonesian Word Sense Disambiguation Task

no code implementations • GWC 2018 • Rahmad Mahendra, Heninggar Septiantri, Haryo Akbarianto Wibowo, Ruli Manurung, Mirna Adriani

Ambiguity is a problem we frequently face in Natural Language Processing.

Word Sense Disambiguation

Paper
Add Code

IndoCulture: Exploring Geographically-Influenced Cultural Commonsense Reasoning Across Eleven Indonesian Provinces

no code implementations • 2 Apr 2024 • Fajri Koto, Rahmad Mahendra, Nurul Aisyah, Timothy Baldwin

Although commonsense reasoning is greatly shaped by cultural and geographical factors, previous studies on language models have predominantly centered on English cultures, potentially resulting in an Anglocentric bias.

Language Modelling

Paper
Add Code

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

1 code implementation • 19 Dec 2022 • Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Fajri Koto, JENNIFER SANTOSO, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Ivan Halim Parmonangan, Ika Alfina, Muhammad Satrio Wicaksono, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Ignatius, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Cuk Tho, Ichwanul Muslim Karo Karo, Tirana Noor Fatyanosa, Ziwei Ji, Pascale Fung, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti

We present NusaCrowd, a collaborative initiative to collect and unify existing resources for Indonesian languages, including opening access to previously non-public resources.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

252

Paper
Code

NusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian Languages

no code implementations • 21 Jul 2022 • Samuel Cahyawijaya, Alham Fikri Aji, Holy Lovenia, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Fajri Koto, David Moeljadi, Karissa Vincentio, Ade Romadhony, Ayu Purwarianti

At the center of the underlying issues that halt Indonesian natural language processing (NLP) research advancement, we find data scarcity.

Paper
Add Code

Two-Stage Classifier for COVID-19 Misinformation Detection Using BERT: a Study on Indonesian Tweets

2 code implementations • 30 Jun 2022 • Douglas Raevan Faisal, Rahmad Mahendra

Although there were already several studies related to the detection of misinformation in social media data, most studies focused on the English dataset.

Language Modelling Misinformation +2

Paper
Code

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages

2 code implementations • 31 May 2022 • Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Rahmad Mahendra, Fajri Koto, Ade Romadhony, Kemal Kurniawan, David Moeljadi, Radityo Eko Prasojo, Pascale Fung, Timothy Baldwin, Jey Han Lau, Rico Sennrich, Sebastian Ruder

In this work, we focus on developing resources for languages in Indonesia.

Machine Translation Translation

Paper
Code

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia

no code implementations • ACL 2022 • Alham Fikri Aji, Genta Indra Winata, Fajri Koto, Samuel Cahyawijaya, Ade Romadhony, Rahmad Mahendra, Kemal Kurniawan, David Moeljadi, Radityo Eko Prasojo, Timothy Baldwin, Jey Han Lau, Sebastian Ruder

NLP research is impeded by a lack of resources and awareness of the challenges presented by underrepresented languages and dialects.

Paper
Add Code

ITTC @ TREC 2021 Clinical Trials Track

no code implementations • 16 Feb 2022 • Thinh Hung Truong, Yulia Otmakhova, Rahmad Mahendra, Timothy Baldwin, Jey Han Lau, Trevor Cohn, Lawrence Cavedon, Damiano Spina, Karin Verspoor

This paper describes the submissions of the Natural Language Processing (NLP) team from the Australian Research Council Industrial Transformation Training Centre (ITTC) for Cognitive Computing in Medical Technologies to the TREC 2021 Clinical Trials Track.

Retrieval

Paper
Add Code

IndoNLI: A Natural Language Inference Dataset for Indonesian

1 code implementation • EMNLP 2021 • Rahmad Mahendra, Alham Fikri Aji, Samuel Louvan, Fahrurrozi Rahman, Clara Vania

The expert-annotated data is used exclusively as a test set.

Natural Language Inference Sentence +1

Paper
Code

UI at SemEval-2020 Task 4: Commonsense Validation and Explanation by Exploiting Contradiction

no code implementations • SEMEVAL 2020 • Kerenza Doxolodeo, Rahmad Mahendra

This paper describes our submissions into the ComVe challenge, the SemEval 2020 Task 4.

Sentence

Paper
Add Code

Semi-Supervised Low-Resource Style Transfer of Indonesian Informal to Formal Language with Iterative Forward-Translation

1 code implementation • 6 Nov 2020 • Haryo Akbarianto Wibowo, Tatag Aziz Prawiro, Muhammad Ihsan, Alham Fikri Aji, Radityo Eko Prasojo, Rahmad Mahendra, Suci Fitriany

In this work, we address a style-transfer from informal to formal Indonesian as a low-resource machine translation problem.

Machine Translation Style Transfer +1

114

Paper
Code

IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding

3 code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Bryan Wilie, Karissa Vincentio, Genta Indra Winata, Samuel Cahyawijaya, Xiaohong Li, Zhi Yuan Lim, Sidik Soleman, Rahmad Mahendra, Pascale Fung, Syafri Bahar, Ayu Purwarianti

Although Indonesian is known to be the fourth most frequently used language over the internet, the research progress on this language in the natural language processing (NLP) is slow-moving due to a lack of available resources.

Benchmarking Natural Language Understanding +2

503

Paper
Code

Normalization of Indonesian-English Code-Mixed Twitter Data

no code implementations • WS 2019 • Anab Maulana Barik, Rahmad Mahendra, Mirna Adriani

Twitter is an excellent source of data for NLP researches as it offers tremendous amount of textual data.

Language Identification Lexical Normalization +1

Paper
Add Code

Keyphrases Extraction from User-Generated Contents in Healthcare Domain Using Long Short-Term Memory Networks

no code implementations • WS 2018 • Ilham Fathy Saputra, Rahmad Mahendra, Alfan Farizki Wicaksono

We propose keyphrases extraction technique to extract important terms from the healthcare user-generated contents.

Question Answering Text Classification +2

Paper
Add Code

Multi-Task Active Learning for Neural Semantic Role Labeling on Low Resource Conversational Corpus

no code implementations • WS 2018 • Fariz Ikhwantri, Samuel Louvan, Kemal Kurniawan, Bagas Abisena, Valdi Rachman, Alfan Farizki Wicaksono, Rahmad Mahendra

In this paper, we propose a Multi-Task Active Learning framework for Semantic Role Labeling with Entity Recognition (ER) as the auxiliary task to alleviate the need for extensive data and use additional information from ER to help SRL.

Active Learning Multi-Task Learning +1

Paper
Add Code

KOI at SemEval-2018 Task 5: Building Knowledge Graph of Incidents

no code implementations • SEMEVAL 2018 • Paramita Mirza, Fariz Darari, Rahmad Mahendra

We present KOI (Knowledge of Incidents), a system that given news articles as input, builds a knowledge graph (KOI-KG) of incidental events.

Clustering coreference-resolution +6

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.