Search Results for author: David Bensaid

Found 1 papers, 1 papers with code

FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions

1 code implementation28 May 2023 Noam Rotstein, David Bensaid, Shaked Brody, Roy Ganz, Ron Kimmel

Our proposed method, FuseCap, fuses the outputs of such vision experts with the original captions using a large language model (LLM), yielding comprehensive image descriptions.

 Ranked #1 on Image Captioning on COCO Captions (CLIPScore metric)

Attribute Image Captioning +5

Cannot find the paper you are looking for? You can Submit a new open access paper.