Search Results for author: Yogesh S. Rawat

Found 8 papers, 2 papers with code

Robustness Analysis on Foundational Segmentation Models

no code implementations • 15 Jun 2023 • Madeline Chantry Schiappa, Shehreen Azad, Sachidanand VS, Yunhao Ge, Ondrej Miksik, Yogesh S. Rawat, Vibhav Vineet

In this work, we perform a robustness analysis of Visual Foundation Models (VFMs) for segmentation tasks and focus on robustness against real-world distribution shift inspired perturbations.

object-detection Object Detection +1

Paper
Add Code

A Large-Scale Robustness Analysis of Video Action Recognition Models

no code implementations • CVPR 2023 • Madeline Chantry Schiappa, Naman Biyani, Prudvi Kamtam, Shruti Vyas, Hamid Palangi, Vibhav Vineet, Yogesh S. Rawat

In this work, we perform a large-scale robustness analysis of these existing models for video action recognition.

Action Recognition Temporal Action Localization

Paper
Add Code

Hybrid Active Learning via Deep Clustering for Video Action Detection

no code implementations • CVPR 2023 • Aayush J. Rana, Yogesh S. Rawat

This hybrid strategy reduces the annotation cost from two different aspects leading to significant labeling cost reduction.

Action Detection Active Learning +3

Paper
Add Code

SVGraph: Learning Semantic Graphs from Instructional Videos

no code implementations • 16 Jul 2022 • Madeline C. Schiappa, Yogesh S. Rawat

In this work, we focus on generating graphical representations of noisy, instructional videos for video understanding.

Graph Learning Video Understanding

Paper
Add Code

Robustness Analysis of Video-Language Models Against Visual and Language Perturbations

1 code implementation • 5 Jul 2022 • Madeline C. Schiappa, Shruti Vyas, Hamid Palangi, Yogesh S. Rawat, Vibhav Vineet

Joint visual and language modeling on large-scale datasets has recently shown good progress in multi-modal tasks when compared to single modal learning.

Language Modelling Retrieval +2

Paper
Code

Self-Supervised Learning for Videos: A Survey

1 code implementation • 18 Jun 2022 • Madeline C. Schiappa, Yogesh S. Rawat, Mubarak Shah

In this survey, we provide a review of existing approaches on self-supervised learning focusing on the video domain.

Contrastive Learning Domain Generalization +2

Paper
Code

Novel View Video Prediction Using a Dual Representation

no code implementations • 7 Jun 2021 • Sarah Shiraz, Krishna Regmi, Shruti Vyas, Yogesh S. Rawat, Mubarak Shah

We address the problem of novel view video prediction; given a set of input video clips from a single/multiple views, our network is able to predict the video from a novel view.

SSIM Video Prediction

Paper
Add Code

PLM: Partial Label Masking for Imbalanced Multi-label Classification

no code implementations • 22 May 2021 • Kevin Duarte, Yogesh S. Rawat, Mubarak Shah

By stochastically masking labels during loss computation, the method balances this ratio for each class, leading to improved recall on minority classes and improved precision on frequent classes.

Classification Image Classification +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.