Search Results for author: Yamato Okamoto

Found 4 papers, 0 papers with code

CREPE: Coordinate-Aware End-to-End Document Parser

no code implementations1 May 2024 Yamato Okamoto, Youngmin Baek, Geewook Kim, Ryota Nakao, Donghyun Kim, Moon Bin Yim, Seunghyun Park, Bado Lee

CREPE's abilities including OCR and semantic parsing not only mitigate error propagation issues in existing OCR-dependent methods, it also significantly enhance the functionality of sequence generation models, ushering in a new era for document understanding studies.

document understanding Optical Character Recognition (OCR) +3

The Effects of Short Video-Sharing Services on Video Copy Detection

no code implementations26 Mar 2024 rintaro yanagi, Yamato Okamoto, Shuhei Yokoo, Shin'ichi Satoh

From the experimental results focusing on segment-level and video-level situations, we can see that three effects: "Segment-level VCD in short video-sharing services is more difficult than those in general video-sharing services", "Video-level VCD in short video-sharing services is easier than those in general video-sharing services", "The video alignment component mainly suppress the detection performance in short video-sharing services".

Copy Detection Video Alignment

Constructing Image-Text Pair Dataset from Books

no code implementations3 Oct 2023 Yamato Okamoto, Haruto Toyonaga, Yoshihisa Ijiri, Hirokatsu Kataoka

Digital archiving is becoming widespread owing to its effectiveness in protecting valuable books and providing knowledge to many people electronically.

Optical Character Recognition (OCR) Retrieval +1

Cannot find the paper you are looking for? You can Submit a new open access paper.