Search Results for author: Md Rizwan Parvez

Found 10 papers, 5 papers with code

Improving Language Models Trained with Translated Data via Continual Pre-Training and Dictionary Learning Analysis

no code implementations • 23 May 2024 • Sabri Boughorbel, Md Rizwan Parvez, Majd Hawasly

We translate TinyStories, a dataset of 2. 2M short stories for 3-4 year old children, from English to Arabic using the free NLLB-3B MT model.

Data Augmentation Dictionary Learning +3

Paper
Add Code

MapCoder: Multi-Agent Code Generation for Competitive Problem Solving

1 code implementation • 18 May 2024 • Md. Ashraful Islam, Mohammed Eunus Ali, Md Rizwan Parvez

In this paper, we introduce a new approach to code generation tasks leveraging multi-agent prompting that uniquely replicates the full cycle of program synthesis as observed in human developers.

Code Generation Program Synthesis

Paper
Code

ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning

no code implementations • 14 Mar 2024 • Ahmed Masry, Mehrad Shahmohammadi, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty

Further evaluation shows that our instruction-tuning approach supports a wide array of real-world chart comprehension and reasoning scenarios, thereby expanding the scope and applicability of our models to new kinds of tasks.

Instruction Following Question Answering

Paper
Add Code

Evidence to Generate (E2G): A Single-agent Two-step Prompting for Context Grounded and Retrieval Augmented Reasoning

no code implementations • 11 Jan 2024 • Md Rizwan Parvez

Instead of unverified reasoning claims, this innovative approach leverages the power of "evidence for decision making" by first focusing exclusively on the thought sequences (the series of intermediate steps) explicitly mentioned in the context which then serve as extracted evidence, guiding the LLM's output generation process with greater precision and efficiency.

Decision Making Hallucination +1

Paper
Add Code

DelucionQA: Detecting Hallucinations in Domain-specific Question Answering

no code implementations • 8 Dec 2023 • Mobashir Sadat, Zhengyu Zhou, Lukas Lange, Jun Araki, Arsalan Gundroo, Bingqing Wang, Rakesh R Menon, Md Rizwan Parvez, Zhe Feng

Hallucination is a well-known phenomenon in text generated by large language models (LLMs).

Hallucination Information Retrieval +2

Paper
Add Code

Learning to Filter Context for Retrieval-Augmented Generation

1 code implementation • 14 Nov 2023 • Zhiruo Wang, Jun Araki, Zhengbao Jiang, Md Rizwan Parvez, Graham Neubig

To alleviate these problems, we propose FILCO, a method that improves the quality of the context provided to the generator by (1) identifying useful context based on lexical and information-theoretic approaches, and (2) training context filtering models that can filter retrieved contexts at test time.

Extractive Question-Answering Fact Verification +2

165

Paper
Code

xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval

3 code implementations • 6 Mar 2023 • Mohammad Abdullah Matin Khan, M Saiful Bari, Xuan Long Do, Weishi Wang, Md Rizwan Parvez, Shafiq Joty

Recently, pre-trained large language models (LLMs) have shown impressive abilities in generating codes from natural language descriptions, repairing buggy codes, translating codes between languages, and retrieving relevant code segments.

Program Repair Retrieval

Paper
Code

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies

no code implementations • 19 Apr 2022 • Md Rizwan Parvez, Jianfeng Chi, Wasi Uddin Ahmad, Yuan Tian, Kai-Wei Chang

Prior studies in privacy policies frame the question answering (QA) task as identifying the most relevant text segment or a list of sentences from a policy document given a user query.

Data Augmentation Question Answering +1

Paper
Add Code

Retrieval Augmented Code Generation and Summarization

1 code implementation • Findings (EMNLP) 2021 • Md Rizwan Parvez, Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

To mimic developers' code or summary generation behavior, we propose a retrieval augmented framework, REDCODER, that retrieves relevant code or summaries from a retrieval database and provides them as a supplement to code generation or summarization models.

Ranked #1 on Code Generation on CodeXGLUE - CodeSearchNet (using extra training data)

Code Generation Code Summarization +1

Paper
Code

Evaluating the Values of Sources in Transfer Learning

1 code implementation • NAACL 2021 • Md Rizwan Parvez, Kai-Wei Chang

Transfer learning that adapts a model trained on data-rich sources to low-resource targets has been widely applied in natural language processing (NLP).

Ranked #1 on Cross-Lingual POS Tagging on Universal Dependency Treebank

Cross-Lingual POS Tagging Transfer Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.