Search Results for author: Kanta Kaneda

Found 4 papers, 4 papers with code

Polos: Multimodal Metric Learning from Human Feedback for Image Captioning

1 code implementation • 28 Feb 2024 • Yuiga Wada, Kanta Kaneda, Daichi Saito, Komei Sugiura

Establishing an automatic evaluation metric that closely aligns with human judgments is essential for effectively developing image captioning models.

Contrastive Learning Image Captioning +1

Paper
Code

Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine

1 code implementation • 26 Dec 2023 • Kanta Kaneda, Shunya Nagashima, Ryosuke Korekata, Motonari Kambara, Komei Sugiura

Therefore, we focus on the task of retrieving target objects from open-vocabulary user instructions in a human-in-the-loop setting, which we define as the learning-to-rank physical objects (LTRPO) task.

Learning-To-Rank

Paper
Code

DialMAT: Dialogue-Enabled Transformer with Moment-Based Adversarial Training

1 code implementation • 12 Nov 2023 • Kanta Kaneda, Ryosuke Korekata, Yuiga Wada, Shunya Nagashima, Motonari Kambara, Yui Iioka, Haruka Matsuo, Yuto Imai, Takayuki Nishimura, Komei Sugiura

This paper focuses on the DialFRED task, which is the task of embodied instruction following in a setting where an agent can actively ask questions about the task.

Instruction Following Position

Paper
Code

JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models

1 code implementation • 7 Nov 2023 • Yuiga Wada, Kanta Kaneda, Komei Sugiura

Image captioning studies heavily rely on automatic evaluation metrics such as BLEU and METEOR.

Image Captioning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.