Search Results for author: Fahim Faisal

Found 20 papers, 9 papers with code

Findings of the VarDial Evaluation Campaign 2022

1 code implementation • VarDial (COLING) 2022 • Noëmi Aepli, Antonios Anastasopoulos, Adrian-Gabriel Chifu, William Domingues, Fahim Faisal, Mihaela Gaman, Radu Tudor Ionescu, Yves Scherrer

This report presents the results of the shared tasks organized as part of the VarDial Evaluation Campaign 2022.

Dialect Identification Extractive Question-Answering +1

Paper
Code

Data-Augmentation-Based Dialectal Adaptation for LLMs

2 code implementations • 11 Apr 2024 • Fahim Faisal, Antonios Anastasopoulos

We propose an approach that combines the strengths of different types of language models and leverages data augmentation techniques to improve task performance on three South Slavic dialects: Chakavian, Cherkano, and Torlak.

Data Augmentation Natural Language Understanding

159

Paper
Code

An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models

1 code implementation • 29 Mar 2024 • Fahim Faisal, Antonios Anastasopoulos

The capacity and effectiveness of pre-trained multilingual models (MLMs) for zero-shot cross-lingual transfer is well established.

Zero-Shot Cross-Lingual Transfer

Paper
Code

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

1 code implementation • 16 Mar 2024 • Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos

This allows for a comprehensive evaluation of NLP system performance on different language varieties.

Paper
Code

To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer

1 code implementation • 12 Oct 2023 • Md Mushfiqur Rahman, Fardin Ahsan Sakib, Fahim Faisal, Antonios Anastasopoulos

To understand the downstream implications of text representation choices, we perform a comparative analysis on language models having diverse text representation modalities including 2 segmentation-based models (\texttt{BERT}, \texttt{mBERT}), 1 image-based model (\texttt{PIXEL}), and 1 character-level model (\texttt{CANINE}).

Cross-Lingual Transfer Dependency Parsing +4

Paper
Code

Multilingual Text Representation

no code implementations • 2 Sep 2023 • Fahim Faisal

Modern NLP breakthrough includes large multilingual models capable of performing tasks across more than 100 languages.

Common Sense Reasoning Natural Language Understanding +1

Paper
Add Code

Investigation on Machine Learning Based Approaches for Estimating the Critical Temperature of Superconductors

no code implementations • 2 Aug 2023 • Fatin Abrar Shams, Rashed Hasan Ratul, Ahnaf Islam Naf, Syed Shaek Hossain Samir, Mirza Muntasir Nishat, Fahim Faisal, Md. Ashraful Hoque

To bridge the gap, numerous machine learning techniques have been established to estimate critical temperatures as it is extremely challenging to determine.

Hyperparameter Optimization

Paper
Add Code

GlobalBench: A Benchmark for Global Progress in Natural Language Processing

no code implementations • 24 May 2023 • Yueqi Song, Catherine Cui, Simran Khanuja, PengFei Liu, Fahim Faisal, Alissa Ostapenko, Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Yulia Tsvetkov, Antonios Anastasopoulos, Graham Neubig

Despite the major advances in NLP, significant disparities in NLP system performance across languages still exist.

Paper
Add Code

GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters

no code implementations • 25 Apr 2023 • Md Mahfuz ibn Alam, Ruoyu Xie, Fahim Faisal, Antonios Anastasopoulos

This report describes GMU's sentiment analysis system for the SemEval-2023 shared task AfriSenti-SemEval.

Language Modelling Sentiment Analysis

Paper
Add Code

Geographic and Geopolitical Biases of Language Models

no code implementations • 20 Dec 2022 • Fahim Faisal, Antonios Anastasopoulos

Pretrained language models (PLMs) often fail to fairly represent target users from certain world regions because of the under-representation of those regions in training datasets.

Paper
Add Code

Phylogeny-Inspired Adaptation of Multilingual Models to New Languages

1 code implementation • 19 May 2022 • Fahim Faisal, Antonios Anastasopoulos

Large pretrained multilingual models, trained on dozens of languages, have delivered promising results due to cross-lingual learning capabilities on variety of language tasks.

Cross-Lingual Transfer

Paper
Code

Survival Prediction of Children Undergoing Hematopoietic Stem Cell Transplantation Using Different Machine Learning Classifiers by Performing Chi-squared Test and Hyper-parameter Optimization: A Retrospective Analysis

no code implementations • 22 Jan 2022 • Ishrak Jahan Ratul, Ummay Habiba Wani, Mirza Muntasir Nishat, Abdullah Al-Monsur, Abrar Mohammad Ar-Rafi, Fahim Faisal, Mohammad Ridwan Kabir

A synthetic dataset is generated by imputing the missing values, transforming the data using dummy variable encoding, and compressing the dataset from 59 features to the 11 most correlated features using Chi-squared feature selection.

feature selection Survival Prediction

Paper
Add Code

Dataset Geography: Mapping Language Data to Language Users

no code implementations • ACL 2022 • Fahim Faisal, Yinkai Wang, Antonios Anastasopoulos

As language technologies become more ubiquitous, there are increasing efforts towards expanding the language diversity and coverage of natural language processing (NLP) systems.

Paper
Add Code

Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network

no code implementations • 16 Oct 2021 • Rafid Ameer Mahmud, Fahim Faisal, Saaduddin Mahmud, Md. Mosaddek Khan

Against this background, we introduce a simulation based online planning algorithm, that we call SiCLOP, for multi-agent cooperative environments.

Behavioural cloning Decision Making +1

Paper
Add Code

SD-QA: Spoken Dialectal Question Answering for the Real World

1 code implementation • Findings (EMNLP) 2021 • Fahim Faisal, Sharlina Keshava, Md Mahfuz ibn Alam, Antonios Anastasopoulos

Question answering (QA) systems are now available through numerous commercial applications for a wide variety of domains, serving millions of users that interact with them via speech interfaces.

Fairness Question Answering +2

Paper
Code

Investigating Post-pretraining Representation Alignment for Cross-Lingual Question Answering

no code implementations • EMNLP (MRQA) 2021 • Fahim Faisal, Antonios Anastasopoulos

Human knowledge is collectively encoded in the roughly 6500 languages spoken around the world, but it is not distributed equally across languages.

Cross-Lingual Question Answering

Paper
Add Code

Design, Simulation and Feasibility Analysis of Bifacial Solar PV System in Marine Drive Road, Cox's Bazar

no code implementations • 20 Sep 2021 • Abdullah Al Mehadi, Mirza Muntasir Nishat, Fahim Faisal, Ahmed Raza Hasan Bhuiyan, Mohyeu Hussain, Md Ashraful Hoque

A model road of 200 meters is reconnoitered for energy harvesting by solar power using three prominent software namely PVSOL, PVsyst, and SAM where a promising mean annual yield of 70492. 9 kWh is obtained, and the bifacial gain is calculated to be 12. 26%.

Paper
Add Code

Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors

1 code implementation • ACL (NLP4Prog) 2021 • Junayed Mahmud, Fahim Faisal, Raihan Islam Arnob, Antonios Anastasopoulos, Kevin Moran

Automated source code summarization is a popular software engineering research topic wherein machine translation models are employed to "translate" code snippets into relevant natural language descriptions.

Code Summarization Machine Translation +2

Paper
Code

Mining Temporal Evolution of Knowledge Graph and Genealogical Features for Literature-based Discovery Prediction

1 code implementation • 22 Jul 2019 • Nazim Choudhury, Fahim Faisal, Matloob Khushi

Existing techniques from Information Retrieval and Natural Language Processing attempt to identify the hidden or unpublished connections between information concepts within published literature, however, these techniques undermine the concept of predicting the future and emerging relations among scientific knowledge components encapsulated within the literature.

Implicit Relations Information Retrieval +2

Paper
Code

Disease Identification From Unstructured User Input

no code implementations • 1 May 2019 • Fahim Faisal, Shafkat Ahmed Bhuiyan, Dr. Abu Raihan Mostofa Kamal

A method to identify probable diseases from the unstructured textual input (eg, health forum posts) by incorporating a lexicographic and semantic feature based two-phase text classification module and a symptom-disease correlation-based similarity measurement module.

General Classification Question Answering +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.