Search Results for author: Dipika Singhania

Found 7 papers, 5 papers with code

C2F-TCN: A Framework for Semi and Fully Supervised Temporal Action Segmentation

no code implementations • 20 Dec 2022 • Dipika Singhania, Rahul Rahaman, Angela Yao

For the task of temporal action segmentation, we propose an encoder-decoder-style architecture named C2F-TCN featuring a "coarse-to-fine" ensemble of decoder outputs.

Action Segmentation Decoder +2

Paper
Add Code

A Generalized & Robust Framework For Timestamp Supervision in Temporal Action Segmentation

no code implementations • 20 Jul 2022 • Rahul Rahaman, Dipika Singhania, Alexandre Thiery, Angela Yao

In temporal action segmentation, Timestamp supervision requires only a handful of labelled frames per video sequence.

Action Segmentation TAG

Paper
Add Code

Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities

1 code implementation • CVPR 2022 • Fadime Sener, Dibyadip Chatterjee, Daniel Shelepov, Kun He, Dipika Singhania, Robert Wang, Angela Yao

Assembly101 is a new procedural activity dataset featuring 4321 videos of people assembling and disassembling 101 "take-apart" toy vehicles.

3D Action Recognition Action Anticipation +2

Paper
Code

Iterative Contrast-Classify For Semi-supervised Temporal Action Segmentation

1 code implementation • 2 Dec 2021 • Dipika Singhania, Rahul Rahaman, Angela Yao

Our method hinges on unsupervised representation learning, which, for temporal action segmentation, poses unique challenges.

Action Segmentation Representation Learning +2

Paper
Code

Coarse to Fine Multi-Resolution Temporal Convolutional Network

1 code implementation • 23 May 2021 • Dipika Singhania, Rahul Rahaman, Angela Yao

In this work, we propose a novel temporal encoder-decoder to tackle the problem of sequence fragmentation.

Ranked #3 on Action Segmentation on Assembly101

Action Segmentation Decoder +3

Paper
Code

Rethinking CNN Models for Audio Classification

3 code implementations • 22 Jul 2020 • Kamalesh Palanisamy, Dipika Singhania, Angela Yao

Besides, we show that even though we use the pretrained model weights for initialization, there is variance in performance in various output runs of the same model.

Environmental Sound Classification General Classification +2

122

Paper
Code

Temporal Aggregate Representations for Long-Range Video Understanding

2 code implementations • ECCV 2020 • Fadime Sener, Dipika Singhania, Angela Yao

Future prediction, especially in long-range videos, requires reasoning from current and past observations.

Ranked #2 on Action Anticipation on Assembly101

Action Anticipation Action Recognition +4

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.