no code implementations • 7 Dec 2023 • Kairui Yang, Zihao Guo, Gengjie Lin, Haotian Dong, Die Zuo, Jibin Peng, Zhao Huang, Zhecheng Xu, Fupeng Li, Ziyun Bai, Di Lin
To facilitate the research of NLD simulation, we collect the Language-to-Interaction(L2I) benchmark dataset with 120, 000 natural-language descriptions of object interactions in 6 common types of road topologies.
no code implementations • ICCV 2023 • Haotian Dong, Enhui Ma, Lubo Wang, Miaohui Wang, Wuyuan Xie, Qing Guo, Ping Li, Lingyu Liang, Kairui Yang, Di Lin
In this paper, we propose Cross-View Synthesis Transformer (CVSformer), which consists of Multi-View Feature Synthesis and Cross-View Transformer for learning cross-view object relationships.