no code implementations • 28 Feb 2024 • Jian Liu, Sipeng Zhang, Chuixin Kong, Wenyuan Zhang, Yuhang Wu, Yikang Ding, Borun Xu, Ruibo Ming, Donglai Wei, Xianming Liu
This technical report presents our solution, "occTransformer" for the 3D occupancy prediction track in the autonomous driving challenge at CVPR 2023.
no code implementations • 5 Apr 2023 • Donglai Wei, Sipeng Zhang, Tong Yang, Yang Liu, Jing Liu
On the other hand, the Masking Caption Modeling (MCM) loss leverages a masked captions prediction task to establish detailed and generic relationships between textual and visual parts.
1 code implementation • 22 Jun 2022 • Chuyang Zhao, Haobo Chen, Wenyuan Zhang, Junru Chen, Sipeng Zhang, Yadong Li, Boxun Li
Natural language (NL) based vehicle retrieval aims to search specific vehicle given text description.