TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Retrieval	LSMDC	Large-Scale Discriminative Clustering	text-to-video R@1	7.3	# 35
Video Retrieval	LSMDC	Large-Scale Discriminative Clustering	text-to-video R@5	19.2	# 32
Video Retrieval	LSMDC	Large-Scale Discriminative Clustering	text-to-video R@10	27.1	# 31
Video Retrieval	LSMDC	Large-Scale Discriminative Clustering	text-to-video Median Rank	52	# 21

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-from-video-and-text-via-large-scale/video-retrieval-on-lsmdc)](https://paperswithcode.com/sota/video-retrieval-on-lsmdc?p=learning-from-video-and-text-via-large-scale)`

Learning from Video and Text via Large-Scale Discriminative Clustering

ICCV 2017 · Antoine Miech, Jean-Baptiste Alayrac, Piotr Bojanowski, Ivan Laptev, Josef Sivic ·

Discriminative clustering has been successfully applied to a number of weakly-supervised learning tasks. Such applications include person and action recognition, text-to-video alignment, object co-segmentation and colocalization in videos and images. One drawback of discriminative clustering, however, is its limited scalability. We address this issue and propose an online optimization algorithm based on the Block-Coordinate Frank-Wolfe algorithm. We apply the proposed method to the problem of weakly supervised learning of actions and actors from movies together with corresponding movie scripts. The scaling up of the learning problem to 66 feature length movies enables us to significantly improve weakly supervised action recognition.

PDF Abstract ICCV 2017 PDF ICCV 2017 Abstract

Code

Add Remove Mark official

jpeyre/unrel

antoine77340/iccv17learning

Tasks

Add Remove

Action Recognition

Clustering

Temporal Action Localization

Video Alignment

Video Retrieval

Weakly-Supervised Action Recognition

Weakly-supervised Learning

Datasets

LSMDC

Results from the Paper

Edit

Ranked #35 on Video Retrieval on LSMDC

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Retrieval	LSMDC	Large-Scale Discriminative Clustering	text-to-video R@1	7.3	# 35	Compare
			text-to-video R@5	19.2	# 32	Compare
			text-to-video R@10	27.1	# 31	Compare
			text-to-video Median Rank	52	# 21	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Learning from Video and Text via Large-Scale Discriminative Clustering

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove