no code implementations • 30 Apr 2024 • Tahira Shehzadi, Shalini Sarode, Didier Stricker, Muhammad Zeshan Afzal
However, recent advancements in the field have shifted the focus towards transformer-based techniques, eliminating the need for NMS and emphasizing object queries and attention mechanisms.
no code implementations • 27 Apr 2024 • Tahira Shehzadi, Didier Stricker, Muhammad Zeshan Afzal
This paper navigates the complexities of understanding various elements within document images, such as text, images, tables, and headings.
no code implementations • 2 Apr 2024 • Tahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker, Muhammad Zeshan Afzal
In this paper, we address the limitations of the DETR-based semi-supervised object detection (SSOD) framework, particularly focusing on the challenges posed by the quality of object queries.
Ranked #1 on Semi-Supervised Object Detection on COCO
no code implementations • 23 Jun 2023 • Tahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker, Marcus Liwicki, Muhammad Zeshan Afzal
Upon integrating query modifications in the DETR, we outperform prior works and achieve new state-of-the-art results with the mAP of 96. 9\%, 95. 7\% and 99. 3\% on TableBank, PubLaynet, PubTables, respectively.
Ranked #3 on Document Layout Analysis on PubLayNet val
2 code implementations • 7 Jun 2023 • Tahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker, Muhammad Zeshan Afzal
The astounding performance of transformers in natural language processing (NLP) has motivated researchers to explore their applications in computer vision tasks.
no code implementations • 4 May 2023 • Tahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker, Marcus Liwicki, Muhammad Zeshan Afzal
Table detection is the task of classifying and localizing table objects within document images.