1 code implementation • 7 Mar 2024 • Yunsong Zhou, Linyan Huang, Qingwen Bu, Jia Zeng, Tianyu Li, Hang Qiu, Hongzi Zhu, Minyi Guo, Yu Qiao, Hongyang Li
Hereby, we introduce the Embodied Language Model (ELM), a comprehensive framework tailored for agents' understanding of driving scenes with large spatial and temporal spans.
1 code implementation • 6 Mar 2024 • Quan Liu, Hongzi Zhu, Zhenxi Wang, Yunsong Zhou, Shan Chang, Minyi Guo
Registration of point clouds collected from a pair of distant vehicles provides a comprehensive and accurate 3D view of the driving scenario, which is vital for driving safety related applications, yet existing literature suffers from the expensive pose label acquisition and the deficiency to generalize to new data distributions.
2 code implementations • ICCV 2023 • Quan Liu, Hongzi Zhu, Yunsong Zhou, Hongyang Li, Shan Chang, Minyi Guo
Registration of distant outdoor LiDAR point clouds is crucial to extending the 3D vision of collaborative autonomous vehicles, and yet is challenging due to small overlapping area and a huge disparity between observed point densities.
Ranked #1 on Point Cloud Registration on nuScenes (Distant PCR)
1 code implementation • 4 May 2023 • Quan Liu, Yunsong Zhou, Hongzi Zhu, Shan Chang, Minyi Guo
Such features are then used for online distant point cloud registration.
Ranked #3 on Point Cloud Registration on nuScenes (Distant PCR)
no code implementations • CVPR 2023 • Yunsong Zhou, Hongzi Zhu, Quan Liu, Shan Chang, Minyi Guo
Mobile monocular 3D object detection (Mono3D) (e. g., on a vehicle, a drone, or a robot) is an important yet challenging task.
no code implementations • 23 Mar 2023 • Yunsong Zhou, Quan Liu, Hongzi Zhu, Yunzhe Li, Shan Chang, Minyi Guo
To this end, we utilize a pose detection network to estimate the pose of the camera and then construct a feature map portraying pixel-level ground depth according to the 3D-to-2D perspective geometry.
no code implementations • CVPR 2021 • Yunsong Zhou, Yuan He, Hongzi Zhu, Cheng Wang, Hongyang Li, Qinhong Jiang
Due to the lack of insight in industrial application, existing methods on open datasets neglect the camera pose information, which inevitably results in the detector being susceptible to camera extrinsic parameters.
Ranked #9 on Monocular 3D Object Detection on KITTI Cars Moderate (using extra training data)
no code implementations • ICCV 2021 • Yunsong Zhou, Hongzi Zhu, Chunqin Li, Tiankai Cui, Shan Chang, Minyi Guo
In this paper, we propose a light-weight semantic segmentation framework for large-scale point cloud series, called TempNet, which can improve both the accuracy and the stability of existing semantic segmentation models by combining a novel frame aggregation scheme.
no code implementations • 2 Apr 2019 • Yuting Jia, Haiwen Wang, Shuo Shao, Huan Long, Yunsong Zhou, Xinbing Wang
Based on this new algorithm, four common geometric properties shared by the activation spaces are concluded, which gives a rather clear description of the activation spaces.