Temporal Localization

55 papers with code • 0 benchmarks • 3 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Temporal Localization models and implementations

Most implemented papers

Video Moment Localization using Object Evidence and Reverse Captioning

madhawav/MML 18 Jun 2020

We address the problem of language-based temporal localization of moments in untrimmed videos.

Accelerating COVID-19 Differential Diagnosis with Explainable Ultrasound Image Analysis

ar-ambuj23/covid19_pocus_ultrasound_pytorch 13 Sep 2020

Controlling the COVID-19 pandemic largely hinges upon the existence of fast, safe, and highly-available diagnostic tools.

Human-centric Spatio-Temporal Video Grounding With Visual Transformers

tzhhhh123/HC-STVG 10 Nov 2020

HC-STVG is a video grounding task that requires both spatial (where) and temporal (when) localization.

VLG-Net: Video-Language Graph Matching Network for Video Grounding

Soldelli/VLG-Net 19 Nov 2020

Grounding language queries in videos aims at identifying the time interval (or moment) semantically relevant to a language query.

Boundary-sensitive Pre-training for Temporal Localization in Videos

frostinassiky/bsp ICCV 2021

However, most existing models developed for these tasks are pre-trained on general video action classification tasks.

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

HumamAlwassel/TSP 23 Nov 2020

Extensive experiments show that using features trained with our novel pretraining strategy significantly improves the performance of recent state-of-the-art methods on three tasks: Temporal Action Localization, Action Proposal Generation, and Dense Video Captioning.

CityFlow-NL: Tracking and Retrieval of Vehicles at City Scale by Natural Language Descriptions

fredfung007/cityflow-nl 12 Jan 2021

In this paper, we focus on two foundational tasks: the Vehicle Retrieval by NL task and the Vehicle Tracking by NL task, which take advantage of the proposed CityFlow-NL benchmark and provide a strong basis for future research on the multi-target multi-camera tracking by NL description task.

Weakly Supervised Action Selection Learning in Video

layer6ai-labs/ASL CVPR 2021

A common approach is to train a frame-level classifier where frames with the highest class probability are selected to make a video-level prediction.

FineAction: A Fine-Grained Video Dataset for Temporal Action Localization

Richard-61/FineAction 24 May 2021

Temporal action localization (TAL) is an important and challenging problem in video understanding.