Search Results for author: David Bensaid

Found 1 papers, 1 papers with code

FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions

1 code implementation • 28 May 2023 • Noam Rotstein, David Bensaid, Shaked Brody, Roy Ganz, Ron Kimmel

Our proposed method, FuseCap, fuses the outputs of such vision experts with the original captions using a large language model (LLM), yielding comprehensive image descriptions.

Ranked #1 on Image Captioning on COCO Captions (CLIPScore metric)

Attribute Image Captioning +5

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.