Search Results for author: Yuchi Ishikawa

Found 3 papers, 2 papers with code

Leveraging Image-Text Similarity and Caption Modification for the DataComp Challenge: Filtering Track and BYOD Track

no code implementations23 Oct 2023 Shuhei Yokoo, Peifei Zhu, Yuchi Ishikawa, Mikihiro Tanaka, Masayoshi Kondo, Hirokatsu Kataoka

Our solution adopts large multimodal models CLIP and BLIP-2 to filter and modify web crawl data, and utilize external datasets along with a bag of tricks to improve the data quality.

text similarity

Alleviating Over-segmentation Errors by Detecting Action Boundaries

2 code implementations14 Jul 2020 Yuchi Ishikawa, Seito Kasai, Yoshimitsu Aoki, Hirokatsu Kataoka

Our model architecture consists of a long-term feature extractor and two branches: the Action Segmentation Branch (ASB) and the Boundary Regression Branch (BRB).

Action Classification Action Segmentation +2

Retrieving and Highlighting Action with Spatiotemporal Reference

1 code implementation19 May 2020 Seito Kasai, Yuchi Ishikawa, Masaki Hayashi, Yoshimitsu Aoki, Kensho Hara, Hirokatsu Kataoka

In this paper, we present a framework that jointly retrieves and spatiotemporally highlights actions in videos by enhancing current deep cross-modal retrieval methods.

Action Recognition Cross-Modal Retrieval +5

Cannot find the paper you are looking for? You can Submit a new open access paper.