Search Results for author: Xiaohui Jiang

Found 3 papers, 2 papers with code

OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

1 code implementation2 May 2024 Shihao Wang, Zhiding Yu, Xiaohui Jiang, Shiyi Lan, Min Shi, Nadine Chang, Jan Kautz, Ying Li, Jose M. Alvarez

We further propose OmniDrive-nuScenes, a new visual question-answering dataset challenging the true 3D situational awareness of a model with comprehensive visual question-answering (VQA) tasks, including scene description, traffic regulation, 3D grounding, counterfactual reasoning, decision making and planning.

Autonomous Driving counterfactual +4

Focal-PETR: Embracing Foreground for Efficient Multi-Camera 3D Object Detection

no code implementations11 Dec 2022 Shihao Wang, Xiaohui Jiang, Ying Li

The 3D-to-2D perspective inconsistency and global attention lead to a weak correlation between foreground tokens and queries, resulting in slow convergence.

3D Object Detection object-detection

Cannot find the paper you are looking for? You can Submit a new open access paper.