1 code implementation • CVPR 2023 • Xinmiao Lin, Yikang Li, Jenhao Hsiao, Chiuman Ho, Yu Kong
The popular VQ-VAE models reconstruct images through learning a discrete codebook but suffer from a significant issue in the rapid quality degradation of image reconstruction as the compression rate rises.
1 code implementation • 21 Mar 2023 • Jingyang Lin, Hang Hua, Ming Chen, Yikang Li, Jenhao Hsiao, Chiuman Ho, Jiebo Luo
We propose a new joint video and text summarization task.
Ranked #1 on Video Summarization on videoxum
no code implementations • 19 Aug 2022 • Shichao Xu, Yikang Li, Jenhao Hsiao, Chiuman Ho, Zhu Qi
In computer vision, multi-label recognition are important tasks with many real-world applications, but classifying previously unseen labels remains a significant challenge.
no code implementations • 2 Feb 2021 • Jenhao Hsiao, Jiawei Chen, Chiuman Ho
These models are trained by applying a deep CNN on single clip of fixed temporal length.