Story Visualization

20 papers with code • 3 benchmarks • 1 datasets

Story Visualization is the task of generating coherent and aligned sequence of images given a sequence of textual captions representing description of a story. It mainly consists of two tasks: story generation and story continuation, where story continuation uses additional ground truth information in the form of the first frame.

Benchmarks

Add a Result

These leaderboards are used to track progress in Story Visualization

Dataset	Best Model	Compare
Pororo	AR-LDM	See all
CLEVR-SV	Impartial Transformer	See all
Zero-Shot Action Execution DiDeMO-CSV	Phenaki-Gen	See all

Datasets

StoryBench

Latest papers with no code

Most implemented Social Latest No code

CogCartoon: Towards Practical Story Visualization

no code yet • 17 Dec 2023

The state-of-the-art methods for story visualization demonstrate a significant demand for training data and storage, as well as limited flexibility in story presentation, thereby rendering them impractical for real-world applications.

Paper
Add Code

Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control

no code yet • 6 Dec 2023

Story Visualization aims to generate images aligned with story prompts, reflecting the coherence of storybooks through visual consistency among characters and scenes. Whereas current approaches exclusively concentrate on characters and neglect the visual consistency among contextually correlated scenes, resulting in independent character images without inter-image coherence. To tackle this issue, we propose a new presentation form for Story Visualization called Storyboard, inspired by film-making, as illustrated in Fig. 1. Specifically, a Storyboard unfolds a story into visual representations scene by scene.

Paper
Add Code

AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort

no code yet • 19 Nov 2023

We empirically find that sparse control conditions, such as bounding boxes, are suitable for layout planning, while dense control conditions, e. g., sketches and keypoints, are suitable for generating high-quality image content.

Paper
Add Code

Style Generation: Image Synthesis based on Coarsely Matched Texts

no code yet • 8 Sep 2023

In this work, we attempt to stylize an input image using such coarsely matched text as guidance.

Paper
Add Code

Improved Visual Story Generation with Adaptive Context Modeling

no code yet • 26 May 2023

Diffusion models developed on top of powerful text-to-image generation models like Stable Diffusion achieve remarkable success in visual story generation.

Paper
Add Code

Counterfactual Edits for Generative Evaluation

no code yet • 2 Mar 2023

Evaluation of generative models has been an underrepresented field despite the surge of generative architectures.

Paper
Add Code

An Impartial Transformer for Story Visualization

no code yet • 9 Jan 2023

Story Visualization is an advanced task of computed vision that targets sequential image synthesis, where the generated samples need to be realistic, faithful to their conditioning and sequentially consistent.

Paper
Add Code

Learning to Model Multimodal Semantic Alignment for Story Visualization

no code yet • 14 Nov 2022

Story visualization aims to generate a sequence of images to narrate each sentence in a multi-sentence story, where the images should be realistic and keep global consistency across dynamic scenes and characters.

Paper
Add Code