Search Results for author: Yuchi Ishikawa

Found 3 papers, 2 papers with code

Leveraging Image-Text Similarity and Caption Modification for the DataComp Challenge: Filtering Track and BYOD Track

no code implementations • 23 Oct 2023 • Shuhei Yokoo, Peifei Zhu, Yuchi Ishikawa, Mikihiro Tanaka, Masayoshi Kondo, Hirokatsu Kataoka

Our solution adopts large multimodal models CLIP and BLIP-2 to filter and modify web crawl data, and utilize external datasets along with a bag of tricks to improve the data quality.

text similarity

Paper
Add Code

Alleviating Over-segmentation Errors by Detecting Action Boundaries

2 code implementations • 14 Jul 2020 • Yuchi Ishikawa, Seito Kasai, Yoshimitsu Aoki, Hirokatsu Kataoka

Our model architecture consists of a long-term feature extractor and two branches: the Action Segmentation Branch (ASB) and the Boundary Regression Branch (BRB).

Ranked #9 on Action Segmentation on GTEA

Action Classification Action Segmentation +2

Paper
Code

Retrieving and Highlighting Action with Spatiotemporal Reference

1 code implementation • 19 May 2020 • Seito Kasai, Yuchi Ishikawa, Masaki Hayashi, Yoshimitsu Aoki, Kensho Hara, Hirokatsu Kataoka

In this paper, we present a framework that jointly retrieves and spatiotemporally highlights actions in videos by enhancing current deep cross-modal retrieval methods.

Action Recognition Cross-Modal Retrieval +5

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.