no code implementations • 25 Apr 2024 • Zhimeng Zheng, Tao Huang, Gongsheng Li, Zuyi Wang
In this paper, we propose a cross-architecture knowledge distillation method for MDE, dubbed DisDepth, to enhance efficient CNN models with the supervision of state-of-the-art transformer models.