TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Prediction	Cityscapes	DMVFN	MS-SSIM	0.9573	# 1
Video Prediction	Cityscapes	DMVFN	LPIPS	0.0558	# 1
Video Prediction	DAVIS 2017	DMVFN	MS-SSIM	0.8397	# 1
Video Prediction	DAVIS 2017	DMVFN	LPIPS	0.0996	# 1
Video Prediction	KITTI	DMVFN	MS-SSIM	0.8853	# 1
Video Prediction	KITTI	DMVFN	LPIPS	0.1074	# 1
Video Prediction	Vimeo90K	DMVFN	MS-SSIM	0.9701	# 1
Video Prediction	Vimeo90K	DMVFN	LPIPS	0.0369	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-dynamic-multi-scale-voxel-flow-network-for/video-prediction-on-cityscapes-1)](https://paperswithcode.com/sota/video-prediction-on-cityscapes-1?p=a-dynamic-multi-scale-voxel-flow-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-dynamic-multi-scale-voxel-flow-network-for/video-prediction-on-davis-2017)](https://paperswithcode.com/sota/video-prediction-on-davis-2017?p=a-dynamic-multi-scale-voxel-flow-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-dynamic-multi-scale-voxel-flow-network-for/video-prediction-on-kitti)](https://paperswithcode.com/sota/video-prediction-on-kitti?p=a-dynamic-multi-scale-voxel-flow-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-dynamic-multi-scale-voxel-flow-network-for/video-prediction-on-vimeo90k)](https://paperswithcode.com/sota/video-prediction-on-vimeo90k?p=a-dynamic-multi-scale-voxel-flow-network-for)`

A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

CVPR 2023 · Xiaotao Hu, Zhewei Huang, Ailin Huang, Jun Xu, Shuchang Zhou ·

The performance of video prediction has been greatly boosted by advanced deep neural networks. However, most of the current methods suffer from large model sizes and require extra inputs, e.g., semantic/depth maps, for promising performance. For efficiency consideration, in this paper, we propose a Dynamic Multi-scale Voxel Flow Network (DMVFN) to achieve better video prediction performance at lower computational costs with only RGB images, than previous methods. The core of our DMVFN is a differentiable routing module that can effectively perceive the motion scales of video frames. Once trained, our DMVFN selects adaptive sub-networks for different inputs at the inference stage. Experiments on several benchmarks demonstrate that our DMVFN is an order of magnitude faster than Deep Voxel Flow and surpasses the state-of-the-art iterative-based OPT on generated image quality. Our code and demo are available at https://huxiaotaostasy.github.io/DMVFN/.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

megvii-research/CVPR2023-DMVFN official

↳ Quickstart in

Colab

315

Tasks

Add Remove

Video Prediction

Datasets

Cityscapes

KITTI

DAVIS

DAVIS 2017

Vimeo90K

Results from the Paper

Edit

Ranked #1 on Video Prediction on Cityscapes

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Video Prediction	Cityscapes	DMVFN	MS-SSIM	0.9573	# 1	Compare
Video Prediction	Cityscapes	DMVFN	LPIPS	0.0558	# 1	Compare
Video Prediction	DAVIS 2017	DMVFN	MS-SSIM	0.8397	# 1	Compare
Video Prediction	DAVIS 2017	DMVFN	LPIPS	0.0996	# 1	Compare
Video Prediction	KITTI	DMVFN	MS-SSIM	0.8853	# 1	Compare
Video Prediction	KITTI	DMVFN	LPIPS	0.1074	# 1	Compare
Video Prediction	Vimeo90K	DMVFN	MS-SSIM	0.9701	# 1	Compare
Video Prediction	Vimeo90K	DMVFN	LPIPS	0.0369	# 2	Compare

Methods

Add Remove

DMVFN • OPT

Edit Social Preview

A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove