Search Results for author: Fahim Faisal

Found 20 papers, 9 papers with code

Data-Augmentation-Based Dialectal Adaptation for LLMs

2 code implementations11 Apr 2024 Fahim Faisal, Antonios Anastasopoulos

We propose an approach that combines the strengths of different types of language models and leverages data augmentation techniques to improve task performance on three South Slavic dialects: Chakavian, Cherkano, and Torlak.

Data Augmentation Natural Language Understanding

An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models

1 code implementation29 Mar 2024 Fahim Faisal, Antonios Anastasopoulos

The capacity and effectiveness of pre-trained multilingual models (MLMs) for zero-shot cross-lingual transfer is well established.

Zero-Shot Cross-Lingual Transfer

To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer

1 code implementation12 Oct 2023 Md Mushfiqur Rahman, Fardin Ahsan Sakib, Fahim Faisal, Antonios Anastasopoulos

To understand the downstream implications of text representation choices, we perform a comparative analysis on language models having diverse text representation modalities including 2 segmentation-based models (\texttt{BERT}, \texttt{mBERT}), 1 image-based model (\texttt{PIXEL}), and 1 character-level model (\texttt{CANINE}).

Cross-Lingual Transfer Dependency Parsing +4

Multilingual Text Representation

no code implementations2 Sep 2023 Fahim Faisal

Modern NLP breakthrough includes large multilingual models capable of performing tasks across more than 100 languages.

Common Sense Reasoning Natural Language Understanding +1

Geographic and Geopolitical Biases of Language Models

no code implementations20 Dec 2022 Fahim Faisal, Antonios Anastasopoulos

Pretrained language models (PLMs) often fail to fairly represent target users from certain world regions because of the under-representation of those regions in training datasets.

Phylogeny-Inspired Adaptation of Multilingual Models to New Languages

1 code implementation19 May 2022 Fahim Faisal, Antonios Anastasopoulos

Large pretrained multilingual models, trained on dozens of languages, have delivered promising results due to cross-lingual learning capabilities on variety of language tasks.

Cross-Lingual Transfer

Dataset Geography: Mapping Language Data to Language Users

no code implementations ACL 2022 Fahim Faisal, Yinkai Wang, Antonios Anastasopoulos

As language technologies become more ubiquitous, there are increasing efforts towards expanding the language diversity and coverage of natural language processing (NLP) systems.

Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network

no code implementations16 Oct 2021 Rafid Ameer Mahmud, Fahim Faisal, Saaduddin Mahmud, Md. Mosaddek Khan

Against this background, we introduce a simulation based online planning algorithm, that we call SiCLOP, for multi-agent cooperative environments.

Behavioural cloning Decision Making +1

SD-QA: Spoken Dialectal Question Answering for the Real World

1 code implementation Findings (EMNLP) 2021 Fahim Faisal, Sharlina Keshava, Md Mahfuz ibn Alam, Antonios Anastasopoulos

Question answering (QA) systems are now available through numerous commercial applications for a wide variety of domains, serving millions of users that interact with them via speech interfaces.

Fairness Question Answering +2

Investigating Post-pretraining Representation Alignment for Cross-Lingual Question Answering

no code implementations EMNLP (MRQA) 2021 Fahim Faisal, Antonios Anastasopoulos

Human knowledge is collectively encoded in the roughly 6500 languages spoken around the world, but it is not distributed equally across languages.

Cross-Lingual Question Answering

Design, Simulation and Feasibility Analysis of Bifacial Solar PV System in Marine Drive Road, Cox's Bazar

no code implementations20 Sep 2021 Abdullah Al Mehadi, Mirza Muntasir Nishat, Fahim Faisal, Ahmed Raza Hasan Bhuiyan, Mohyeu Hussain, Md Ashraful Hoque

A model road of 200 meters is reconnoitered for energy harvesting by solar power using three prominent software namely PVSOL, PVsyst, and SAM where a promising mean annual yield of 70492. 9 kWh is obtained, and the bifacial gain is calculated to be 12. 26%.

Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors

1 code implementation ACL (NLP4Prog) 2021 Junayed Mahmud, Fahim Faisal, Raihan Islam Arnob, Antonios Anastasopoulos, Kevin Moran

Automated source code summarization is a popular software engineering research topic wherein machine translation models are employed to "translate" code snippets into relevant natural language descriptions.

Code Summarization Machine Translation +2

Mining Temporal Evolution of Knowledge Graph and Genealogical Features for Literature-based Discovery Prediction

1 code implementation22 Jul 2019 Nazim Choudhury, Fahim Faisal, Matloob Khushi

Existing techniques from Information Retrieval and Natural Language Processing attempt to identify the hidden or unpublished connections between information concepts within published literature, however, these techniques undermine the concept of predicting the future and emerging relations among scientific knowledge components encapsulated within the literature.

Implicit Relations Information Retrieval +2

Disease Identification From Unstructured User Input

no code implementations1 May 2019 Fahim Faisal, Shafkat Ahmed Bhuiyan, Dr. Abu Raihan Mostofa Kamal

A method to identify probable diseases from the unstructured textual input (eg, health forum posts) by incorporating a lexicographic and semantic feature based two-phase text classification module and a symptom-disease correlation-based similarity measurement module.

General Classification Question Answering +2

Cannot find the paper you are looking for? You can Submit a new open access paper.