Search Results for author: Pingjie Wang

M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation

Video-grounded dialogue generation (VDG) requires the system to generate a fluent and accurate answer based on multimodal knowledge.

Paper
Add Code

Generating dialogue grounded in videos requires a high level of understanding and reasoning about the visual scenes in the videos.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.