Search Results for author: Yi Tu

Found 6 papers, 2 papers with code

Rethinking the Evaluation of Pre-trained Text-and-Layout Models from an Entity-Centric Perspective

no code implementations4 Feb 2024 Chong Zhang, Yixi Zhao, Chenshu Yuan, Yi Tu, Ya Guo, Qi Zhang

Therefore, we claim the necessary standards for an ideal benchmark to evaluate the information extraction ability of PTLMs.

Entity Linking

Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction

1 code implementation17 Oct 2023 Chong Zhang, Ya Guo, Yi Tu, Huan Chen, Jinyang Tang, Huijia Zhu, Qi Zhang, Tao Gui

However, BIO-tagging scheme relies on the correct order of model inputs, which is not guaranteed in real-world NER on scanned VrDs where text are recognized and arranged by OCR systems.

Entity Linking Key Information Extraction +9

LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding

no code implementations30 May 2023 Yi Tu, Ya Guo, Huan Chen, Jinyang Tang

LayoutMask can enhance the interactions between text and layout modalities in a unified model and produce adaptive and robust multi-modal representations for downstream tasks.

Document Image Classification document understanding +7

Image Cropping with Composition and Saliency Aware Aesthetic Score Map

no code implementations24 Nov 2019 Yi Tu, Li Niu, Weijie Zhao, Dawei Cheng, Liqing Zhang

Aesthetic image cropping is a practical but challenging task which aims at finding the best crops with the highest aesthetic quality in an image.

Image Cropping

Learning from Web Data with Self-Organizing Memory Module

no code implementations CVPR 2020 Yi Tu, Li Niu, Junjie Chen, Dawei Cheng, Liqing Zhang

However, crawled web images usually have two types of noises, label noise and background noise, which induce extra difficulties in utilizing them effectively.

Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.