TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Retrieval	MSR-VTT	Kaufman	text-to-video R@1	4.7	# 38
Video Retrieval	MSR-VTT	Kaufman	text-to-video R@10	24.1	# 34
Video Retrieval	MSR-VTT	Kaufman	text-to-video Median Rank	41	# 18
Video Retrieval	MSR-VTT	Kaufman	video-to-text R@5	16.6	# 13

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/temporal-tessellation-a-unified-approach-for/video-retrieval-on-msr-vtt)](https://paperswithcode.com/sota/video-retrieval-on-msr-vtt?p=temporal-tessellation-a-unified-approach-for)`

Temporal Tessellation: A Unified Approach for Video Analysis

ICCV 2017 · Dotan Kaufman, Gil Levi, Tal Hassner, Lior Wolf ·

We present a general approach to video understanding, inspired by semantic transfer techniques that have been successfully used for 2D image analysis. Our method considers a video to be a 1D sequence of clips, each one associated with its own semantics. The nature of these semantics -- natural language captions or other labels -- depends on the task at hand. A test video is processed by forming correspondences between its clips and the clips of reference videos with known semantics, following which, reference semantics can be transferred to the test video. We describe two matching methods, both designed to ensure that (a) reference clips appear similar to test clips and (b), taken together, the semantics of the selected reference clips is consistent and maintains temporal coherence. We use our method for video captioning on the LSMDC'16 benchmark, video summarization on the SumMe and TVSum benchmarks, Temporal Action Detection on the Thumos2014 benchmark, and sound prediction on the Greatest Hits benchmark. Our method not only surpasses the state of the art, in four out of five benchmarks, but importantly, it is the only single method we know of that was successfully applied to such a diverse range of tasks.

PDF Abstract ICCV 2017 PDF ICCV 2017 Abstract

Code

Add Remove Mark official

dot27/temporal-tessellation official

Tasks

Add Remove

Action Detection

Video Captioning

Video Summarization

Video Understanding

Datasets

MSR-VTT

THUMOS14 TVSum

SumMe

LSMDC

Results from the Paper

Edit

Ranked #38 on Video Retrieval on MSR-VTT

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Retrieval	MSR-VTT	Kaufman	text-to-video R@1	4.7	# 38	Compare
			text-to-video R@10	24.1	# 34	Compare
			text-to-video Median Rank	41	# 18	Compare
			video-to-text R@5	16.6	# 13	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Temporal Tessellation: A Unified Approach for Video Analysis

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove