no code implementations • 22 Apr 2024 • Chen Xu, Tianhui Song, Weixin Feng, Xubin Li, Tiezheng Ge, Bo Zheng, LiMin Wang
Diffusion models have significantly advanced the state of the art in image, audio, and video generation tasks.
1 code implementation • NeurIPS 2023 • Yutao Cui, Tianhui Song, Gangshan Wu, LiMin Wang
Our key design is to introduce four special prediction tokens and concatenate them with the tokens from target template and search areas.