no code implementations • 4 Apr 2024 • Anwesa Choudhuri, Girish Chowdhary, Alexander G. Schwing
To address these issues, we propose Open-World Video Instance Segmentation and Captioning (OW-VISCap), an approach to jointly segment, track, and caption previously seen or unseen objects in a video.
1 code implementation • CVPR 2023 • Anwesa Choudhuri, Girish Chowdhary, Alexander G. Schwing
We evaluate the proposed approach across three challenging tasks: video instance segmentation, multi-object tracking and segmentation, and video panoptic segmentation.
5 code implementations • 20 Dec 2021 • Bowen Cheng, Anwesa Choudhuri, Ishan Misra, Alexander Kirillov, Rohit Girdhar, Alexander G. Schwing
We find Mask2Former also achieves state-of-the-art performance on video instance segmentation without modifying the architecture, the loss or even the training pipeline.
Ranked #14 on Video Instance Segmentation on YouTube-VIS validation
1 code implementation • ICCV 2021 • Anwesa Choudhuri, Girish Chowdhary, Alexander G. Schwing
In contrast, we formulate a global method for MOTS over the space of assignments rather than detections: First, we find all top-k assignments of objects detected and segmented between any two consecutive frames and develop a structured prediction formulation to score assignment sequences across any number of consecutive frames.
Multi-Object Tracking Multi-Object Tracking and Segmentation +4
no code implementations • 25 Sep 2019 • Anwesa Choudhuri, Ashok Vardhan Makkuva, Ranvir Rana, Sewoong Oh, Girish Chowdhary, Alexander Schwing
%In fact, contrastive disentanglement and unsupervised recovery are often combined in that we seek additional variations that exhibit salient factors/properties.