Search Results for author: Hanxue Zhang

Found 2 papers, 1 papers with code

DriveLM: Driving with Graph Visual Question Answering

1 code implementation21 Dec 2023 Chonghao Sima, Katrin Renz, Kashyap Chitta, Li Chen, Hanxue Zhang, Chengen Xie, Ping Luo, Andreas Geiger, Hongyang Li

The experiments demonstrate that Graph VQA provides a simple, principled framework for reasoning about a driving scene, and DriveLM-Data provides a challenging benchmark for this task.

Autonomous Driving Question Answering +1

Improving Audio Caption Fluency with Automatic Error Correction

no code implementations16 Jun 2023 Hanxue Zhang, Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu

Automated audio captioning (AAC) is an important cross-modality translation task, aiming at generating descriptions for audio clips.

Audio captioning Sentence

Cannot find the paper you are looking for? You can Submit a new open access paper.