Search Results for author: Karan Desai

Found 9 papers, 7 papers with code

Hyperbolic Image-Text Representations

1 code implementation18 Apr 2023 Karan Desai, Maximilian Nickel, Tanmay Rajpurohit, Justin Johnson, Ramakrishna Vedantam

Visual and linguistic concepts naturally organize themselves in a hierarchy, where a textual concept "dog" entails all images that contain dogs.

Image Classification Retrieval +1

Learning Visual Representations via Language-Guided Sampling

1 code implementation CVPR 2023 Mohamed El Banani, Karan Desai, Justin Johnson

Our approach diverges from image-based contrastive learning by sampling view pairs using language similarity instead of hand-crafted augmentations or learned clusters.

Contrastive Learning Representation Learning

RedCaps: web-curated image-text data created by the people, for the people

1 code implementation22 Nov 2021 Karan Desai, Gaurav Kaul, Zubin Aysola, Justin Johnson

We introduce RedCaps -- a large-scale dataset of 12M image-text pairs collected from Reddit.

VirTex: Learning Visual Representations from Textual Annotations

3 code implementations CVPR 2021 Karan Desai, Justin Johnson

The de-facto approach to many vision tasks is to start from pretrained visual representations, typically learned via supervised training on ImageNet.

 Ranked #1 on Object Detection on COCO test-dev (Hardware Burden metric)

General Classification Image Captioning +5

Continual Reinforcement Learning in 3D Non-stationary Environments

1 code implementation24 May 2019 Vincenzo Lomonaco, Karan Desai, Eugenio Culurciello, Davide Maltoni

High-dimensional always-changing environments constitute a hard challenge for current reinforcement learning techniques.

reinforcement-learning Reinforcement Learning (RL)

nocaps: novel object captioning at scale

2 code implementations ICCV 2019 Harsh Agrawal, Karan Desai, YuFei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee, Peter Anderson

To encourage the development of image captioning models that can learn visual concepts from alternative data sources, such as object detection datasets, we present the first large-scale benchmark for this task.

Image Captioning Object +2

Cannot find the paper you are looking for? You can Submit a new open access paper.