1 code implementation • 7 Mar 2024 • Shengyun Peng, Seongmin Lee, XiaoJing Wang, Rajarajeswari Balasubramaniyan, Duen Horng Chau
Tables convey factual and quantitative data with implicit conventions created by humans that are often challenging for machines to parse.
1 code implementation • 23 Feb 2024 • Shengyun Peng, Seongmin Lee, XiaoJing Wang, Rajarajeswari Balasubramaniyan, Duen Horng Chau
We discover that the performance gap between the linear projection transformer and the hybrid CNN-transformer can be mitigated by SSP of the visual encoder in the TSR model.
2 code implementations • 9 Nov 2023 • Shengyun Peng, Seongmin Lee, XiaoJing Wang, Rajarajeswari Balasubramaniyan, Duen Horng Chau
This allows it to "see" an appropriate portion of the table and "store" the complex table structure within sufficient context length for the subsequent transformer.
Ranked #3 on Table Recognition on PubTabNet