Paper tables with annotated results for A Multimodal Dialogue System for Conversational Image Editing

Paper

A Multimodal Dialogue System for Conversational Image Editing

In this paper, we present a multimodal dialogue system for Conversational Image Editing. We formulate our multimodal dialogue system as a Partially Observed Markov Decision Process (POMDP) and trained it with Deep Q-Network (DQN) and a user simulator. Our evaluation shows that the DQN policy outperforms a rule-based baseline policy, achieving 90\% success rate under high error rates. We also conducted a real user study and analyzed real user behavior.

PDF Paper record

Results in Papers With Code

(↓ scroll down to see all results)

A Multimodal Dialogue System for Conversational Image Editing

Reader Guidelines

Editor Guidelines