no code implementations • 29 Mar 2024 • Robik Shrestha, Yang Zou, Qiuyu Chen, Zhiheng Li, Yusheng Xie, Siqi Deng
In this work, we introduce Fair Retrieval Augmented Generation (FairRAG), a novel framework that conditions pre-trained generative models on reference images retrieved from an external image database to improve fairness in human generation.
no code implementations • 20 Dec 2023 • Yunye Gong, Robik Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran
We propose a novel VQA dataset, BloomVQA, to facilitate comprehensive evaluation of large vision-language models on comprehension tasks.
1 code implementation • 5 Apr 2022 • Robik Shrestha, Kushal Kafle, Christopher Kanan
We propose a new direction: modifying the network architecture to impose inductive biases that make the network robust to dataset bias.
Ranked #3 on Action Recognition on BAR
1 code implementation • 1 Apr 2021 • Robik Shrestha, Kushal Kafle, Christopher Kanan
We introduce a new dataset called Biased MNIST that enables assessment of robustness to multiple bias sources.
no code implementations • 4 Mar 2021 • Usman Mahmood, Robik Shrestha, David D. B. Bates, Lorenzo Mannelli, Giuseppe Corrias, Yusuf Erdi, Christopher Kanan
Artificial intelligence (AI) has been successful at solving numerous problems in machine perception.
no code implementations • NeurIPS 2020 • Damien Teney, Kushal Kafle, Robik Shrestha, Ehsan Abbasnejad, Christopher Kanan, Anton Van Den Hengel
Out-of-distribution (OOD) testing is increasingly popular for evaluating a machine learning system's ability to generalize beyond the biases of a training set.
1 code implementation • ACL 2020 • Robik Shrestha, Kushal Kafle, Christopher Kanan
Existing Visual Question Answering (VQA) methods tend to exploit dataset biases and spurious statistical correlations, instead of producing right answers for the right reasons.
1 code implementation • ECCV 2020 • Tyler L. Hayes, Kushal Kafle, Robik Shrestha, Manoj Acharya, Christopher Kanan
While there is neuroscientific evidence that the brain replays compressed memories, existing methods for convolutional networks replay raw images.
1 code implementation • 5 Aug 2019 • Kushal Kafle, Robik Shrestha, Brian Price, Scott Cohen, Christopher Kanan
Chart question answering (CQA) is a newly proposed visual question answering (VQA) task where an algorithm must answer questions about data visualizations, e. g. bar charts, pie charts, and line graphs.
no code implementations • 19 Apr 2019 • Kushal Kafle, Robik Shrestha, Christopher Kanan
Language grounded image understanding tasks have often been proposed as a method for evaluating progress in artificial intelligence.
2 code implementations • CVPR 2019 • Robik Shrestha, Kushal Kafle, Christopher Kanan
Visual Question Answering (VQA) research is split into two camps: the first focuses on VQA datasets that require natural image understanding and the second focuses on synthetic datasets that test reasoning.