no code implementations • 19 Feb 2024 • Hongcheng Liu, Pingjie Wang, Yu Wang, Yanfeng Wang
Video-grounded dialogue generation (VDG) requires the system to generate a fluent and accurate answer based on multimodal knowledge.
no code implementations • 26 Sep 2023 • Hongcheng Liu, Zhe Chen, Hui Li, Pingjie Wang, Yanfeng Wang, Yu Wang
Generating dialogue grounded in videos requires a high level of understanding and reasoning about the visual scenes in the videos.