no code implementations • 28 Mar 2024 • Eri Onami, Shuhei Kurita, Taiki Miyanishi, Taro Watanabe
Document question answering is a task of question answering on given documents such as reports, slides, pamphlets, and websites, and it is a truly demanding task as paper and electronic forms of documents are so common in our society.
1 code implementation • ICCV 2023 • Shuhei Kurita, Naoki Katsura, Eri Onami
In the conventional referring expression comprehension tasks of images, however, datasets are mostly constructed based on the web-crawled data and don't reflect diverse real-world structures on the task of grounding textual expressions in diverse objects in the real world.