Recurrent Partial Kernel Network for Efficient Optical Flow Estimation

Optical flow estimation is a challenging task consisting of predicting per-pixel motion vectors between images. Recent methods have employed larger and more complex models to improve the estimation accuracy. However, this impacts the widespread adoption of optical flow methods and makes it harder to train more general models since the optical flow data is hard to obtain. This paper proposes a small and efficient model for optical flow estimation. We design a new spatial recurrent encoder that extracts discriminative features at a significantly reduced size. Unlike standard recurrent units, we utilize Partial Kernel Convolution (PKConv) layers to produce variable multi-scale features with a single shared block. We also design efficient Separable Large Kernels (SLK) to capture large context information with low computational cost. Experiments on public benchmarks show that we achieve state-of-the-art generalization performance while requiring significantly fewer parameters and memory than competing methods. Our model ranks first in the Spring benchmark without finetuning, improving the results by over 10% while requiring an order of magnitude fewer FLOPs and over four times less memory than the following published method without finetuning.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Optical Flow Estimation KITTI 2015 RPKNet Fl-all 4.64 # 3
Fl-fg 4.69 # 5
Optical Flow Estimation KITTI 2015 (train) RPKNet F1-all 13.0 # 1
EPE 3.79 # 2
Optical Flow Estimation Sintel-clean RPKNet Average End-Point Error 1.315 # 5
Optical Flow Estimation Sintel-final RPKNet Average End-Point Error 2.657 # 8
Optical Flow Estimation Spring RPKNet 1px total 4.809 # 2

Methods