Search Results for author: Xiantao Cai

Found 3 papers, 2 papers with code

Multi-modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models

no code implementations14 Dec 2023 Liqi He, Zuchao Li, Xiantao Cai, Ping Wang

Overall, our approach offers a more robust and effective solution for multi-modal reasoning in language models, enhancing their ability to tackle complex real-world problems.

Machine Translation

Enhancing Visually-Rich Document Understanding via Layout Structure Modeling

1 code implementation15 Aug 2023 Qiwei Li, Zuchao Li, Xiantao Cai, Bo Du, Hai Zhao

In this paper, we propose GraphLayoutLM, a novel document understanding model that leverages the modeling of layout structure graph to inject document layout knowledge into the model.

document understanding

Cannot find the paper you are looking for? You can Submit a new open access paper.