no code implementations • 7 Apr 2024 • Xingtong Ge, Jixiang Luo, Xinjie Zhang, Tongda Xu, Guo Lu, Dailan He, Jing Geng, Yan Wang, Jun Zhang, Hongwei Qin
Prior research on deep video compression (DVC) for machine tasks typically necessitates training a unique codec for each specific task, mandating a dedicated decoder per task.
1 code implementation • 13 Mar 2024 • Xinjie Zhang, Xingtong Ge, Tongda Xu, Dailan He, Yan Wang, Hongwei Qin, Guo Lu, Jing Geng, Jun Zhang
In response, we propose a groundbreaking paradigm of image representation and compression by 2D Gaussian Splatting, named GaussianImage.
1 code implementation • 26 Feb 2024 • Chunyi Li, Guo Lu, Donghui Feng, HaoNing Wu, ZiCheng Zhang, Xiaohong Liu, Guangtao Zhai, Weisi Lin, Wenjun Zhang
With the evolution of storage and communication protocols, ultra-low bitrate image compression has become a highly demanding topic.
no code implementations • 2 Feb 2024 • ZhiYu Zhang, Guo Lu, Huanxiong Liang, Anni Tang, Qiang Hu, Li Song
Volumetric videos, benefiting from immersive 3D realism and interactivity, hold vast potential for various applications, while the tremendous data volume poses significant challenges for compression.
no code implementations • 29 Nov 2023 • Chen Zhu, Guo Lu, Bing He, Rong Xie, Li Song
To further enhance the reconstruction quality from the INR codec, we leverage the high-quality reconstructed frames from the explicit codec to achieve inter-view compensation.
no code implementations • ICCV 2023 • Yuan Tian, Guo Lu, Guangtao Zhai, Zhiyong Gao
Most video compression methods aim to improve the decoded video visual quality, instead of particularly guaranteeing the semantic-completeness, which deteriorates downstream video analysis tasks, e. g., action recognition.
no code implementations • 20 Dec 2022 • Guanbo Pan, Guo Lu, Zhihao Hu, Dong Xu
Although several content adaptive methods have been proposed by updating the encoder-side components, the adaptability of both latents and the decoder is not well exploited.
no code implementations • CVPR 2022 • Zhihao Hu, Guo Lu, Jinyang Guo, Shan Liu, Wei Jiang, Dong Xu
The previous deep video compression approaches only use the single scale motion compensation strategy and rarely adopt the mode prediction technique from the traditional standards like H. 264/H. 265 for both motion and residual compression.
no code implementations • 12 Jun 2022 • Guo Lu, Xingtong Ge, Tianxiong Zhong, Jing Geng, Qiang Hu
Specifically, we propose a neural preprocessing module before the encoder to maintain the useful semantic information for the downstream tasks and suppress the irrelevant information for bitrate saving.
no code implementations • 6 Feb 2022 • Yuan Tian, Guo Lu, Yichao Yan, Guangtao Zhai, Li Chen, Zhiyong Gao
However, in real-world scenarios, the videos are first compressed before the transportation and then decompressed for understanding.
no code implementations • CVPR 2022 • Zhenghao Chen, Guo Lu, Zhihao Hu, Shan Liu, Wei Jiang, Dong Xu
In this work, we propose the first end-to-end optimized framework for compressing automotive stereo videos (i. e., stereo videos from autonomous driving applications) from both left and right views.
no code implementations • CVPR 2022 • Guo Lu, Tianxiong Zhong, Jing Geng, Qiang Hu, Dong Xu
Specifically, given the image in the reference modality (e. g., the infrared image), we use the channel-wise alignment module to produce the aligned features based on the affine transform.
1 code implementation • ICCV 2021 • Yuan Tian, Guo Lu, Xiongkuo Min, Zhaohui Che, Guangtao Zhai, Guodong Guo, Zhiyong Gao
After optimization, the downscaled video by our framework preserves more meaningful information, which is beneficial for both the upscaling step and the downstream tasks, e. g., video action recognition task.
no code implementations • 26 May 2021 • Jinyang Guo, Dong Xu, Guo Lu
Furthermore, to achieve variable bitrate decoding with one single decoder, we propose a bitrate adaptive module to project the representation from a base bitrate to the expected representation at a target bitrate for transmission.
no code implementations • CVPR 2021 • Zhihao Hu, Guo Lu, Dong Xu
In this work, we propose a feature-space video coding network (FVC) by performing all major operations (i. e., motion estimation, motion compression, motion compensation and residual compression) in the feature space.
no code implementations • CVPR 2021 • Zizheng Que, Guo Lu, Dong Xu
In this paper, we propose a two-stage deep learning framework called VoxelContext-Net for both static and dynamic point cloud compression.
no code implementations • 10 Dec 2020 • Hong Zhang, Haojie Li, Shenglun Chen, Tiantian Yan, Zhihui Wang, Guo Lu, Wanli Ouyang
To make the Adaptive-Grained Depth Refinement stage robust to the coarse depth and adaptive to the depth range of the points, the Granularity Uncertainty is introduced to Adaptive-Grained Depth Refinement stage.
no code implementations • ECCV 2020 • Zhihao Hu, Zhenghao Chen, Dong Xu, Guo Lu, Wanli Ouyang, Shuhang Gu
In this work, we propose a new framework called Resolution-adaptive Flow Coding (RaFC) to effectively compress the flow maps globally and locally, in which we use multi-resolution representations instead of single-resolution representations for both the input flow maps and the output motion features of the MV encoder.
no code implementations • 11 May 2020 • Geng Zhan, Dan Xu, Guo Lu, Wei Wu, Chunhua Shen, Wanli Ouyang
Existing anchor-based and anchor-free object detectors in multi-stage or one-stage pipelines have achieved very promising detection performance.
no code implementations • ECCV 2020 • Guo Lu, Chunlei Cai, Xiaoyun Zhang, Li Chen, Wanli Ouyang, Dong Xu, Zhiyong Gao
Therefore, the encoder is adaptive to different video contents and achieves better compression performance by reducing the domain gap between the training and testing datasets.
1 code implementation • 9 Feb 2020 • Jiaheng Liu, Guo Lu, Zhihao Hu, Dong Xu
Our EDIC method can also be readily incorporated with the Deep Video Compression (DVC) framework to further improve the video compression performance.
1 code implementation • 31 Dec 2019 • Hongwen Zhang, Jie Cao, Guo Lu, Wanli Ouyang, Zhenan Sun
Reconstructing 3D human shape and pose from monocular images is challenging despite the promising results achieved by the most recent learning-based methods.
Ranked #75 on 3D Human Pose Estimation on 3DPW (MPJPE metric)
3D human pose and shape estimation 3D Human Reconstruction +3
4 code implementations • CVPR 2019 • Guo Lu, Wanli Ouyang, Dong Xu, Xiaoyun Zhang, Chunlei Cai, Zhiyong Gao
Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information.
1 code implementation • ECCV 2018 • Guo Lu, Wanli Ouyang, Dong Xu, Xiaoyun Zhang, Zhiyong Gao, Ming-Ting Sun
In this paper, we model the video artifact reduction task as a Kalman filtering procedure and restore decoded frames through a deep Kalman filtering network.