Search Results for author: Zhaoxu Tian

Found 1 papers, 1 papers with code

Learning Long-form Video Prior via Generative Pre-Training

1 code implementation24 Apr 2024 Jinheng Xie, Jiajun Feng, Zhaoxu Tian, Kevin Qinghong Lin, Yawen Huang, Xi Xia, Nanxu Gong, Xu Zuo, Jiaqi Yang, Yefeng Zheng, Mike Zheng Shou

Instead of operating on pixel space, it is efficient to employ visual locations like bounding boxes and keypoints to represent key information in videos, which can be simply discretized and then tokenized for consumption by GPT.

Cannot find the paper you are looking for? You can Submit a new open access paper.