no code implementations • 30 Nov 2023 • Rohan Myer Krishnan, Zitian Tang, Zhiqiu Yu, Chen Sun
To do this, video-language models must be able to obtain structured understandings, such as the temporal segmentation of a demonstration into sequences of actions and skills, and to generalize the understandings to novel domains.
no code implementations • 22 May 2023 • Yiwen Huang, Zhiqiu Yu, Xinjie Yi, Yue Wang, James Tompkin
This results in a new model that effectively removes the quality tax between 3DMM conditioned face GANs and the unconditional StyleGAN.