Computer Vision

Temporal Localization

55 papers with code • 0 benchmarks • 3 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Temporal Localization

You can find evaluation results in the subtasks. You can also submitting evaluation metrics for this task.

Libraries

Use these libraries to find Temporal Localization models and implementations

google-research/scenic

2 papers

3,029

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Video Moment Localization using Object Evidence and Reverse Captioning

madhawav/MML • • 18 Jun 2020

We address the problem of language-based temporal localization of moments in untrimmed videos.

1

Paper
Code

Accelerating COVID-19 Differential Diagnosis with Explainable Ultrasound Image Analysis

ar-ambuj23/covid19_pocus_ultrasound_pytorch • • 13 Sep 2020

Controlling the COVID-19 pandemic largely hinges upon the existence of fast, safe, and highly-available diagnostic tools.

1

Paper
Code

Human-centric Spatio-Temporal Video Grounding With Visual Transformers

tzhhhh123/HC-STVG • 10 Nov 2020

HC-STVG is a video grounding task that requires both spatial (where) and temporal (when) localization.

1

Paper
Code

VLG-Net: Video-Language Graph Matching Network for Video Grounding

Soldelli/VLG-Net • • 19 Nov 2020

Grounding language queries in videos aims at identifying the time interval (or moment) semantically relevant to a language query.

1

Paper
Code

Boundary-sensitive Pre-training for Temporal Localization in Videos

frostinassiky/bsp • ICCV 2021

However, most existing models developed for these tasks are pre-trained on general video action classification tasks.

1

Paper
Code

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

HumamAlwassel/TSP • • 23 Nov 2020

Extensive experiments show that using features trained with our novel pretraining strategy significantly improves the performance of recent state-of-the-art methods on three tasks: Temporal Action Localization, Action Proposal Generation, and Dense Video Captioning.

1

Paper
Code

CityFlow-NL: Tracking and Retrieval of Vehicles at City Scale by Natural Language Descriptions

fredfung007/cityflow-nl • • 12 Jan 2021

In this paper, we focus on two foundational tasks: the Vehicle Retrieval by NL task and the Vehicle Tracking by NL task, which take advantage of the proposed CityFlow-NL benchmark and provide a strong basis for future research on the multi-target multi-camera tracking by NL description task.

1

Paper
Code

Learning Salient Boundary Feature for Anchor-free Temporal Action Localization

TencentYoutuResearch/ActionDetection-AFSD • • CVPR 2021

Temporal action localization is an important yet challenging task in video understanding.

1

Paper
Code

Weakly Supervised Action Selection Learning in Video

layer6ai-labs/ASL • • CVPR 2021

A common approach is to train a frame-level classifier where frames with the highest class probability are selected to make a video-level prediction.

1

Paper
Code

FineAction: A Fine-Grained Video Dataset for Temporal Action Localization

Richard-61/FineAction • • 24 May 2021

Temporal action localization (TAL) is an important and challenging problem in video understanding.

1

Paper
Code