1 code implementation • 12 Jan 2024 • Seongyun Lee, Seungone Kim, Sue Hyun Park, Geewook Kim, Minjoon Seo
Assessing long-form responses generated by Vision-Language Models (VLMs) is challenging.
1 code implementation • 13 Nov 2023 • Seongyun Lee, Sue Hyun Park, Yongrae Jo, Minjoon Seo
Building on this approach, we introduce Volcano, a multimodal self-feedback guided revision model.
Ranked #43 on Visual Question Answering on MM-Vet