no code implementations • 18 Jan 2024 • Xin Yuan, Jinoo Baek, Keyang Xu, Omer Tov, Hongliang Fei
We propose an efficient diffusion-based text-to-video super-resolution (SR) tuning approach that leverages the readily learned capacity of pixel level image diffusion model to capture spatial information for video generation.