Search Results for author: Gaurav Mittal

Found 16 papers, 5 papers with code

Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection

1 code implementation24 Jul 2023 Christopher Clarke, Matthew Hall, Gaurav Mittal, Ye Yu, Sandra Sajeev, Jason Mars, Mei Chen

In this paper, we present Rule By Example (RBE): a novel exemplar-based contrastive learning approach for learning from logical rules for the task of textual content moderation.

Contrastive Learning Hate Speech Detection

Rethinking Multimodal Content Moderation from an Asymmetric Angle with Mixed-modality

no code implementations17 May 2023 Jialin Yuan, Ye Yu, Gaurav Mittal, Matthew Hall, Sandra Sajeev, Mei Chen

There is a rapidly growing need for multimodal content moderation (CM) as more and more content on social media is multimodal in nature.

PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization

no code implementations CVPR 2023 Mamshad Nayeem Rizve, Gaurav Mittal, Ye Yu, Matthew Hall, Sandra Sajeev, Mubarak Shah, Mei Chen

To address this, we present PivoTAL, Prior-driven Supervision for Weakly-supervised Temporal Action Localization, to approach WTAL from a localization-by-localization perspective by learning to localize the action snippets directly.

Weakly Supervised Action Localization Weakly Supervised Temporal Action Localization

ProTeGe: Untrimmed Pretraining for Video Temporal Grounding by Video Temporal Grounding

no code implementations CVPR 2023 Lan Wang, Gaurav Mittal, Sandra Sajeev, Ye Yu, Matthew Hall, Vishnu Naresh Boddeti, Mei Chen

We present ProTeGe as the first method to perform VTG-based untrimmed pretraining to bridge the gap between trimmed pretrained backbones and downstream VTG tasks.

text similarity

BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation

no code implementations1 Aug 2022 Ye Yu, Jialin Yuan, Gaurav Mittal, Li Fuxin, Mei Chen

It captures object motion in the video via a novel optical flow calibration module that fuses the segmentation mask with optical flow estimation to improve within-object optical flow smoothness and reduce noise at object boundaries.

 Ranked #1 on Video Object Segmentation on DAVIS 2017 (test-dev) (using extra training data)

Object Optical Flow Estimation +6

GateHUB: Gated History Unit with Background Suppression for Online Action Detection

no code implementations CVPR 2022 Junwen Chen, Gaurav Mittal, Ye Yu, Yu Kong, Mei Chen

We present GateHUB, Gated History Unit with Background Suppression, that comprises a novel position-guided gated cross-attention mechanism to enhance or suppress parts of the history as per how informative they are for current frame prediction.

Online Action Detection Optical Flow Estimation

MUSE: Feature Self-Distillation with Mutual Information and Self-Information

no code implementations25 Oct 2021 Yu Gong, Ye Yu, Gaurav Mittal, Greg Mori, Mei Chen

Importantly, we argue and empirically demonstrate that MUSE, compared to other feature discrepancy functions, is a more functional proxy to introduce dependency and effectively improve the expressivity of all features in the knowledge distillation framework.

Image Classification Knowledge Distillation +2

On Adversarial Robustness: A Neural Architecture Search perspective

1 code implementation16 Jul 2020 Chaitanya Devaguptapu, Devansh Agarwal, Gaurav Mittal, Pulkit Gopalani, Vineeth N Balasubramanian

We show that NAS, which is popular for achieving SoTA accuracy, can provide adversarial accuracy as a free add-on without any form of adversarial training.

Adversarial Robustness Neural Architecture Search

HyperSTAR: Task-Aware Hyperparameters for Deep Networks

no code implementations CVPR 2020 Gaurav Mittal, Chang Liu, Nikolaos Karianakis, Victor Fragoso, Mei Chen, Yun Fu

To reduce HPO time, we present HyperSTAR (System for Task Aware Hyperparameter Recommendation), a task-aware method to warm-start HPO for deep neural networks.

Hyperparameter Optimization Image Classification

Animating Face using Disentangled Audio Representations

no code implementations2 Oct 2019 Gaurav Mittal, Baoyuan Wang

All previous methods for audio-driven talking head generation assume the input audio to be clean with a neutral tone.

Representation Learning Talking Head Generation

Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures

1 code implementation30 Nov 2016 Gaurav Mittal, Tanya Marwah, Vineeth N. Balasubramanian

This paper introduces a novel approach for generating videos called Synchronized Deep Recurrent Attentive Writer (Sync-DRAW).

Text-to-Video Generation Video Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.