Search Results for author: Edouard Belval

Found 2 papers, 0 papers with code

Enhancing Vision-Language Pre-training with Rich Supervisions

no code implementations5 Mar 2024 Yuan Gao, Kunyu Shi, Pengkai Zhu, Edouard Belval, Oren Nuriel, Srikar Appalaraju, Shabnam Ghadar, Vijay Mahadevan, Zhuowen Tu, Stefano Soatto

We propose Strongly Supervised pre-training with ScreenShots (S4) - a novel pre-training paradigm for Vision-Language Models using data from large-scale web screenshot rendering.

Table Detection

MATrIX -- Modality-Aware Transformer for Information eXtraction

no code implementations17 May 2022 Thomas Delteil, Edouard Belval, Lei Chen, Luis Goncalves, Vijay Mahadevan

In these, text semantics and visual information supplement each other to provide a global understanding of the document.

document understanding

Cannot find the paper you are looking for? You can Submit a new open access paper.