Search Results for author: Robik Shrestha

Found 11 papers, 6 papers with code

FairRAG: Fair Human Generation via Fair Retrieval Augmentation

no code implementations • 29 Mar 2024 • Robik Shrestha, Yang Zou, Qiuyu Chen, Zhiheng Li, Yusheng Xie, Siqi Deng

In this work, we introduce Fair Retrieval Augmented Generation (FairRAG), a novel framework that conditions pre-trained generative models on reference images retrieved from an external image database to improve fairness in human generation.

Fairness Image Generation +1

Paper
Add Code

BloomVQA: Assessing Hierarchical Multi-modal Comprehension

no code implementations • 20 Dec 2023 • Yunye Gong, Robik Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran

We propose a novel VQA dataset, BloomVQA, to facilitate comprehensive evaluation of large vision-language models on comprehension tasks.

Data Augmentation Memorization +2

Paper
Add Code

OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses

1 code implementation • 5 Apr 2022 • Robik Shrestha, Kushal Kafle, Christopher Kanan

We propose a new direction: modifying the network architecture to impose inductive biases that make the network robust to dataset bias.

Ranked #3 on Action Recognition on BAR

Action Recognition

Paper
Code

Are Bias Mitigation Techniques for Deep Learning Effective?

1 code implementation • 1 Apr 2021 • Robik Shrestha, Kushal Kafle, Christopher Kanan

We introduce a new dataset called Biased MNIST that enables assessment of robustness to multiple bias sources.

Question Answering Visual Question Answering

Paper
Code

Detecting Spurious Correlations with Sanity Tests for Artificial Intelligence Guided Radiology Systems

no code implementations • 4 Mar 2021 • Usman Mahmood, Robik Shrestha, David D. B. Bates, Lorenzo Mannelli, Giuseppe Corrias, Yusuf Erdi, Christopher Kanan

Artificial intelligence (AI) has been successful at solving numerous problems in machine perception.

Paper
Add Code

On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law

no code implementations • NeurIPS 2020 • Damien Teney, Kushal Kafle, Robik Shrestha, Ehsan Abbasnejad, Christopher Kanan, Anton Van Den Hengel

Out-of-distribution (OOD) testing is increasingly popular for evaluating a machine learning system's ability to generalize beyond the biases of a training set.

Model Selection Question Answering +1

Paper
Add Code

Visual Grounding Methods for VQA are Working for the Wrong Reasons!

1 code implementation • ACL 2020 • Robik Shrestha, Kushal Kafle, Christopher Kanan

Existing Visual Question Answering (VQA) methods tend to exploit dataset biases and spurious statistical correlations, instead of producing right answers for the right reasons.

Question Answering Visual Grounding +1

Paper
Code

REMIND Your Neural Network to Prevent Catastrophic Forgetting

1 code implementation • ECCV 2020 • Tyler L. Hayes, Kushal Kafle, Robik Shrestha, Manoj Acharya, Christopher Kanan

While there is neuroscientific evidence that the brain replays compressed memories, existing methods for convolutional networks replay raw images.

Quantization Question Answering +1

Paper
Code

Answering Questions about Data Visualizations using Efficient Bimodal Fusion

1 code implementation • 5 Aug 2019 • Kushal Kafle, Robik Shrestha, Brian Price, Scott Cohen, Christopher Kanan

Chart question answering (CQA) is a newly proposed visual question answering (VQA) task where an algorithm must answer questions about data visualizations, e. g. bar charts, pie charts, and line graphs.

Ranked #1 on Visual Question Answering (VQA) on DVQA test-familiar

Chart Question Answering Optical Character Recognition +3

Paper
Code

Challenges and Prospects in Vision and Language Research

no code implementations • 19 Apr 2019 • Kushal Kafle, Robik Shrestha, Christopher Kanan

Language grounded image understanding tasks have often been proposed as a method for evaluating progress in artificial intelligence.

Natural Language Understanding

Paper
Add Code

Answer Them All! Toward Universal Visual Question Answering Models

2 code implementations • CVPR 2019 • Robik Shrestha, Kushal Kafle, Christopher Kanan

Visual Question Answering (VQA) research is split into two camps: the first focuses on VQA datasets that require natural image understanding and the second focuses on synthetic datasets that test reasoning.

Question Answering Visual Question Answering

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.