Rethinking RAFT for Efficient Optical Flow

1 Jan 2024  ·  Navid Eslami, Farnoosh Arefi, Amir M. Mansourian, Shohreh Kasaei ·

Despite significant progress in deep learning-based optical flow methods, accurately estimating large displacements and repetitive patterns remains a challenge. The limitations of local features and similarity search patterns used in these algorithms contribute to this issue. Additionally, some existing methods suffer from slow runtime and excessive graphic memory consumption. To address these problems, this paper proposes a novel approach based on the RAFT framework. The proposed Attention-based Feature Localization (AFL) approach incorporates the attention mechanism to handle global feature extraction and address repetitive patterns. It introduces an operator for matching pixels with corresponding counterparts in the second frame and assigning accurate flow values. Furthermore, an Amorphous Lookup Operator (ALO) is proposed to enhance convergence speed and improve RAFTs ability to handle large displacements by reducing data redundancy in its search operator and expanding the search space for similarity extraction. The proposed method, Efficient RAFT (Ef-RAFT),achieves significant improvements of 10% on the Sintel dataset and 5% on the KITTI dataset over RAFT. Remarkably, these enhancements are attained with a modest 33% reduction in speed and a mere 13% increase in memory usage. The code is available at: https://github.com/n3slami/Ef-RAFT

PDF Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Optical Flow Estimation KITTI 2015 (train) Ef-RAFT F1-all 16.45 # 6
EPE 4.83 # 7
Optical Flow Estimation Sintel-clean Ef-RAFT Average End-Point Error 1.27 # 4
Optical Flow Estimation Sintel-final Ef-RAFT Average End-Point Error 2.60 # 6

Methods