no code implementations • 18 Mar 2024 • Xinhao Xiang, Simon Dräger, Jiawei Zhang
The accuracy-speed-memory trade-off is always the priority to consider for several computer vision perception tasks.
no code implementations • 7 Nov 2023 • Xinhao Xiang, Simon Dräger, Jiawei Zhang
We propose the 3DifFusionDet framework in this paper, which structures 3D object detection as a denoising diffusion process from noisy 3D boxes to target boxes.
no code implementations • 7 Nov 2023 • Xinhao Xiang, Jiawei Zhang
Different from the existing 3D object detection approaches, FusionViT is a pure-ViT based framework, which adopts a hierarchical architecture by extending the transformer model to embed both images and point clouds for effective representation learning.