Temporal Localization
55 papers with code • 0 benchmarks • 3 datasets
Benchmarks
These leaderboards are used to track progress in Temporal Localization
Libraries
Use these libraries to find Temporal Localization models and implementationsMost implemented papers
Video Moment Localization using Object Evidence and Reverse Captioning
We address the problem of language-based temporal localization of moments in untrimmed videos.
Accelerating COVID-19 Differential Diagnosis with Explainable Ultrasound Image Analysis
Controlling the COVID-19 pandemic largely hinges upon the existence of fast, safe, and highly-available diagnostic tools.
Human-centric Spatio-Temporal Video Grounding With Visual Transformers
HC-STVG is a video grounding task that requires both spatial (where) and temporal (when) localization.
VLG-Net: Video-Language Graph Matching Network for Video Grounding
Grounding language queries in videos aims at identifying the time interval (or moment) semantically relevant to a language query.
Boundary-sensitive Pre-training for Temporal Localization in Videos
However, most existing models developed for these tasks are pre-trained on general video action classification tasks.
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Extensive experiments show that using features trained with our novel pretraining strategy significantly improves the performance of recent state-of-the-art methods on three tasks: Temporal Action Localization, Action Proposal Generation, and Dense Video Captioning.
CityFlow-NL: Tracking and Retrieval of Vehicles at City Scale by Natural Language Descriptions
In this paper, we focus on two foundational tasks: the Vehicle Retrieval by NL task and the Vehicle Tracking by NL task, which take advantage of the proposed CityFlow-NL benchmark and provide a strong basis for future research on the multi-target multi-camera tracking by NL description task.
Learning Salient Boundary Feature for Anchor-free Temporal Action Localization
Temporal action localization is an important yet challenging task in video understanding.
Weakly Supervised Action Selection Learning in Video
A common approach is to train a frame-level classifier where frames with the highest class probability are selected to make a video-level prediction.
FineAction: A Fine-Grained Video Dataset for Temporal Action Localization
Temporal action localization (TAL) is an important and challenging problem in video understanding.