no code implementations • 26 Feb 2024 • Jinxu Zhang, Yongqi Yu, Yu Zhang
Document Visual Question Answering (DVQA) is a task that involves responding to queries based on the content of images.
Language Modelling Large Language Model +3