Search Results for author: Prithvijit Chattopadhyay

Found 19 papers, 11 papers with code

We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline

1 code implementation • 1 Feb 2024 • Simar Kareer, Vivek Vijaykumar, Harsh Maheshwari, Prithvijit Chattopadhyay, Judy Hoffman, Viraj Prabhu

While the vast majority of prior work has studied this as a frame-level Image-DAS problem, a few Video-DAS works have sought to additionally leverage the temporal signal present in adjacent frames.

Benchmarking Semantic Segmentation +3

Paper
Code

SkyScenes: A Synthetic Dataset for Aerial Scene Understanding

no code implementations • 11 Dec 2023 • Sahil Khose, Anisha Pal, Aayushi Agarwal, Deepanshi, Judy Hoffman, Prithvijit Chattopadhyay

Real-world aerial scene understanding is limited by a lack of datasets that contain densely annotated images curated under a diverse set of conditions.

Scene Understanding

Paper
Add Code

AUGCAL: Improving Sim2Real Adaptation by Uncertainty Calibration on Augmented Synthetic Images

no code implementations • 11 Dec 2023 • Prithvijit Chattopadhyay, Bharat Goyal, Boglarka Ecsedi, Viraj Prabhu, Judy Hoffman

Synthetic data (SIM) drawn from simulators have emerged as a popular alternative for training models where acquiring annotated real-world images is difficult.

Unsupervised Domain Adaptation

Paper
Add Code

Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks

2 code implementations • NeurIPS 2023 • Micah Goldblum, Hossein Souri, Renkun Ni, Manli Shu, Viraj Prabhu, Gowthami Somepalli, Prithvijit Chattopadhyay, Mark Ibrahim, Adrien Bardes, Judy Hoffman, Rama Chellappa, Andrew Gordon Wilson, Tom Goldstein

Battle of the Backbones (BoB) makes this choice easier by benchmarking a diverse suite of pretrained models, including vision-language models, those trained via self-supervised learning, and the Stable Diffusion backbone, across a diverse set of computer vision tasks ranging from classification to object detection to OOD generalization and more.

Benchmarking object-detection +2

179

Paper
Code

LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images

2 code implementations • NeurIPS 2023 • Viraj Prabhu, Sriram Yenamandra, Prithvijit Chattopadhyay, Judy Hoffman

We propose an automated algorithm to stress-test a trained visual model by generating language-guided counterfactual test images (LANCE).

counterfactual Language Modelling +1

Paper
Code

Benchmarking Low-Shot Robustness to Natural Distribution Shifts

1 code implementation • ICCV 2023 • Aaditya Singh, Kartik Sarangmath, Prithvijit Chattopadhyay, Judy Hoffman

Robustness to natural distribution shifts has seen remarkable progress thanks to recent pre-training strategies combined with better fine-tuning methods.

Benchmarking

Paper
Code

PASTA: Proportional Amplitude Spectrum Training Augmentation for Syn-to-Real Domain Generalization

1 code implementation • ICCV 2023 • Prithvijit Chattopadhyay, Kartik Sarangmath, Vivek Vijaykumar, Judy Hoffman

Synthetic data offers the promise of cheap and bountiful training data for settings where labeled real-world data is scarce.

Domain Generalization object-detection +3

Paper
Code

RobustNav: Towards Benchmarking Robustness in Embodied Navigation

1 code implementation • ICCV 2021 • Prithvijit Chattopadhyay, Judy Hoffman, Roozbeh Mottaghi, Aniruddha Kembhavi

As an attempt towards assessing the robustness of embodied navigation agents, we propose RobustNav, a framework to quantify the performance of embodied navigation agents when exposed to a wide variety of visual - affecting RGB inputs - and dynamics - affecting transition dynamics - corruptions.

Benchmarking Data Augmentation +1

Paper
Code

Learning to Balance Specificity and Invariance for In and Out of Domain Generalization

1 code implementation • ECCV 2020 • Prithvijit Chattopadhyay, Yogesh Balaji, Judy Hoffman

For domain generalization, the goal is to learn from a set of source domains to produce a single model that will best generalize to an unseen target domain.

Ranked #24 on Domain Generalization on DomainNet

Domain Generalization Specificity

Paper
Code

Likelihood Landscapes: A Unifying Principle Behind Many Adversarial Defenses

no code implementations • 25 Aug 2020 • Fu Lin, Rohit Mittapalli, Prithvijit Chattopadhyay, Daniel Bolya, Judy Hoffman

Convolutional Neural Networks have been shown to be vulnerable to adversarial examples, which are known to locate in subspaces close to where normal data lies but are not naturally occurring and of low probability.

Adversarial Defense Adversarial Robustness

Paper
Add Code

DS-VIC: Unsupervised Discovery of Decision States for Transfer in RL

no code implementations • 25 Sep 2019 • Nirbhay Modhe, Prithvijit Chattopadhyay, Mohit Sharma, Abhishek Das, Devi Parikh, Dhruv Batra, Ramakrishna Vedantam

We learn to identify decision states, namely the parsimonious set of states where decisions meaningfully affect the future states an agent can reach in an environment.

Paper
Add Code

Improving Generative Visual Dialog by Answering Diverse Questions

1 code implementation • IJCNLP 2019 • Vishvak Murahari, Prithvijit Chattopadhyay, Dhruv Batra, Devi Parikh, Abhishek Das

Prior work on training generative Visual Dialog models with reinforcement learning(Das et al.) has explored a Qbot-Abot image-guessing game and shown that this 'self-talk' approach can lead to improved performance at the downstream dialog-conditioned image-guessing task.

Representation Learning Visual Dialog

Paper
Code

IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL

no code implementations • 24 Jul 2019 • Nirbhay Modhe, Prithvijit Chattopadhyay, Mohit Sharma, Abhishek Das, Devi Parikh, Dhruv Batra, Ramakrishna Vedantam

We propose a novel framework to identify sub-goals useful for exploration in sequential decision making tasks under partial observability.

Decision Making Hierarchical Reinforcement Learning

Paper
Add Code

EvalAI: Towards Better Evaluation Systems for AI Agents

3 code implementations • 10 Feb 2019 • Deshraj Yadav, Rishabh Jain, Harsh Agrawal, Prithvijit Chattopadhyay, Taranjeet Singh, Akash Jain, Shiv Baran Singh, Stefan Lee, Dhruv Batra

We introduce EvalAI, an open source platform for evaluating and comparing machine learning (ML) and artificial intelligence algorithms (AI) at scale.

Benchmarking BIG-bench Machine Learning

1,685

Paper
Code

Do Explanations make VQA Models more Predictable to a Human?

no code implementations • EMNLP 2018 • Arjun Chandrasekaran, Viraj Prabhu, Deshraj Yadav, Prithvijit Chattopadhyay, Devi Parikh

A rich line of research attempts to make deep neural networks more transparent by generating human-interpretable 'explanations' of their decision process, especially for interactive tasks like Visual Question Answering (VQA).

Question Answering Visual Question Answering

Paper
Add Code

Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance

1 code implementation • ECCV 2018 • Ramprasaath R. Selvaraju, Prithvijit Chattopadhyay, Mohamed Elhoseiny, Tilak Sharma, Dhruv Batra, Devi Parikh, Stefan Lee

Our approach, which we call Neuron Importance-AwareWeight Transfer (NIWT), learns to map domain knowledge about novel "unseen" classes onto this dictionary of learned concepts and then optimizes for network parameters that can effectively combine these concepts - essentially learning classifiers by discovering and composing learned semantic concepts in deep networks.

Generalized Zero-Shot Learning

Paper
Code

Evaluating Visual Conversational Agents via Cooperative Human-AI Games

no code implementations • 17 Aug 2017 • Prithvijit Chattopadhyay, Deshraj Yadav, Viraj Prabhu, Arjun Chandrasekaran, Abhishek Das, Stefan Lee, Dhruv Batra, Devi Parikh

This suggests a mismatch between benchmarking of AI in isolation and in the context of human-AI teams.

Benchmarking

Paper
Add Code

It Takes Two to Tango: Towards Theory of AI's Mind

no code implementations • 3 Apr 2017 • Arjun Chandrasekaran, Deshraj Yadav, Prithvijit Chattopadhyay, Viraj Prabhu, Devi Parikh

Surprisingly, we find that having access to the model's internal states - its confidence in its top-k predictions, explicit or implicit attention maps which highlight regions in the image (and words in the question) the model is looking at (and listening to) while answering a question about an image - do not help people better predict its behavior.

Attribute Question Answering +2

Paper
Add Code

Counting Everyday Objects in Everyday Scenes

1 code implementation • CVPR 2017 • Prithvijit Chattopadhyay, Ramakrishna Vedantam, Ramprasaath R. Selvaraju, Dhruv Batra, Devi Parikh

In this work, we build dedicated models for counting designed to tackle the large variance in counts, appearances, and scales of objects found in natural scenes.

Ranked #1 on Object Counting on COCO count-test

Object Object Counting +4

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.