Search Results for author: Yogesh S. Rawat

Found 8 papers, 2 papers with code

Robustness Analysis on Foundational Segmentation Models

no code implementations15 Jun 2023 Madeline Chantry Schiappa, Shehreen Azad, Sachidanand VS, Yunhao Ge, Ondrej Miksik, Yogesh S. Rawat, Vibhav Vineet

In this work, we perform a robustness analysis of Visual Foundation Models (VFMs) for segmentation tasks and focus on robustness against real-world distribution shift inspired perturbations.

object-detection Object Detection +1

Hybrid Active Learning via Deep Clustering for Video Action Detection

no code implementations CVPR 2023 Aayush J. Rana, Yogesh S. Rawat

This hybrid strategy reduces the annotation cost from two different aspects leading to significant labeling cost reduction.

Action Detection Active Learning +3

SVGraph: Learning Semantic Graphs from Instructional Videos

no code implementations16 Jul 2022 Madeline C. Schiappa, Yogesh S. Rawat

In this work, we focus on generating graphical representations of noisy, instructional videos for video understanding.

Graph Learning Video Understanding

Robustness Analysis of Video-Language Models Against Visual and Language Perturbations

1 code implementation5 Jul 2022 Madeline C. Schiappa, Shruti Vyas, Hamid Palangi, Yogesh S. Rawat, Vibhav Vineet

Joint visual and language modeling on large-scale datasets has recently shown good progress in multi-modal tasks when compared to single modal learning.

Language Modelling Retrieval +2

Self-Supervised Learning for Videos: A Survey

1 code implementation18 Jun 2022 Madeline C. Schiappa, Yogesh S. Rawat, Mubarak Shah

In this survey, we provide a review of existing approaches on self-supervised learning focusing on the video domain.

Contrastive Learning Domain Generalization +2

Novel View Video Prediction Using a Dual Representation

no code implementations7 Jun 2021 Sarah Shiraz, Krishna Regmi, Shruti Vyas, Yogesh S. Rawat, Mubarak Shah

We address the problem of novel view video prediction; given a set of input video clips from a single/multiple views, our network is able to predict the video from a novel view.

SSIM Video Prediction

PLM: Partial Label Masking for Imbalanced Multi-label Classification

no code implementations22 May 2021 Kevin Duarte, Yogesh S. Rawat, Mubarak Shah

By stochastically masking labels during loss computation, the method balances this ratio for each class, leading to improved recall on minority classes and improved precision on frequent classes.

Classification Image Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.