Search Results for author: Zhenzhong Chen

Found 43 papers, 14 papers with code

Transferable Learned Image Compression-Resistant Adversarial Perturbations

no code implementations6 Jan 2024 Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen

Adversarial attacks can readily disrupt the image classification system, revealing the vulnerability of DNN-based recognition tasks.

Adversarial Attack Autonomous Driving +4

Corner-to-Center Long-range Context Model for Efficient Learned Image Compression

no code implementations29 Nov 2023 Yang Sui, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Bo Yuan, Zhenzhong Chen

To tackle this issue, we conduct an in-depth analysis of the performance degradation observed in existing parallel context models, focusing on two aspects: the Quantity and Quality of information utilized for context prediction and decoding.

Image Compression

UnifiedSSR: A Unified Framework of Sequential Search and Recommendation

1 code implementation21 Oct 2023 Jiayi Xie, Shang Liu, Gao Cong, Zhenzhong Chen

In this work, we propose a Unified framework of Sequential Search and Recommendation (UnifiedSSR) for joint learning of user behavior history in both search and recommendation scenarios.

Self-Supervised Learning

Learning Many-to-Many Mapping for Unpaired Real-World Image Super-resolution and Downscaling

no code implementations8 Oct 2023 Wanjie Sun, Zhenzhong Chen

However, the training of image degradation and SR models in this strategy are separate, ignoring the inherent mutual dependency between downscaling and its inverse upscaling process.

Image Super-Resolution

JPEG Quantized Coefficient Recovery via DCT Domain Spatial-Frequential Transformer

no code implementations17 Aug 2023 Mingyu Ouyang, Zhenzhong Chen

However, the current DCT domain methods typically suffer from limited effectiveness in handling a wide range of compression quality factors, or fall short in recovering sparse quantized coefficients and the components across different colorspace.

JPEG Artifact Removal Quantization

Dynamic Kernel-Based Adaptive Spatial Aggregation for Learned Image Compression

no code implementations17 Aug 2023 Huairui Wang, Nianxiang Fu, Zhenzhong Chen, Shan Liu

In this paper, we focus on extending spatial aggregation capability and propose a dynamic kernel-based transform coding.

Image Compression valid

Improving Generalization of Image Captioning with Unsupervised Prompt Learning

no code implementations5 Aug 2023 Hongchen Wei, Zhenzhong Chen

By exploring the variable and invariant features in the original images and attribute-transferred images, attribute consistency constrains the attribute change direction of both images and sentences to learn domain-specific knowledge.

Attribute Image Captioning +2

Reconstruction Distortion of Learned Image Compression with Imperceptible Perturbations

no code implementations1 Jun 2023 Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen

Learned Image Compression (LIC) has recently become the trending technique for image transmission due to its notable performance.

Image Compression Image Reconstruction

LaMD: Latent Motion Diffusion for Video Generation

no code implementations23 Apr 2023 Yaosi Hu, Zhenzhong Chen, Chong Luo

We present a latent motion diffusion (LaMD) framework, which consists of a motion-decomposed video autoencoder and a diffusion-based motion generator, to implement this idea.

Video Generation Video Reconstruction

Continuous Space-Time Video Super-Resolution Utilizing Long-Range Temporal Information

no code implementations26 Feb 2023 Yuantong Zhang, Daiqin Yang, Zhenzhong Chen, Wenpeng Ding

To address these problems, we propose a continuous ST-VSR (C-STVSR) method that can convert the given video to any frame rate and spatial resolution.

Optical Flow Estimation Space-time Video Super-resolution +1

Mutually-Regularized Dual Collaborative Variational Auto-encoder for Recommendation Systems

1 code implementation21 Nov 2022 Yaochen Zhu, Zhenzhong Chen

However, since latent item variables are not modeled in UAE, it is difficult to utilize the widely available item content information when ratings are sparse.

Recommendation Systems

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

2 code implementations7 Nov 2022 Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, Jingang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, Jinwoo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li, Dan Zhu, Mengdi Sun, Ran Duan, Yan Gao, Lingshun Kong, Long Sun, Xiang Li, Xingdong Zhang, Jiawei Zhang, Yaqi Wu, Jinshan Pan, Gaocheng Yu, Jin Zhang, Feng Zhang, Zhe Ma, Hongbin Wang, Hojin Cho, Steve Kim, Huaen Li, Yanbo Ma, Ziwei Luo, Youwei Li, Lei Yu, Zhihong Wen, Qi Wu, Haoqiang Fan, Shuaicheng Liu, Lize Zhang, Zhikai Zong, Jeremy Kwon, Junxi Zhang, Mengyuan Li, Nianxiang Fu, Guanchen Ding, Han Zhu, Zhenzhong Chen, Gen Li, Yuanfan Zhang, Lei Sun, Dafeng Zhang, Neo Yang, Fitz Liu, Jerry Zhao, Mustafa Ayazoglu, Bahri Batuhan Bilecen, Shota Hirose, Kasidis Arunruangsirilert, Luo Ao, Ho Chun Leung, Andrew Wei, Jie Liu, Qiang Liu, Dahai Yu, Ao Li, Lei Luo, Ce Zhu, Seongmin Hong, Dongwon Park, Joonhee Lee, Byeong Hyun Lee, Seunggyu Lee, Se Young Chun, Ruiyuan He, Xuhao Jiang, Haihang Ruan, Xinjian Zhang, Jing Liu, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He

While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints.

Image Super-Resolution

Hierarchical Transformer with Spatio-Temporal Context Aggregation for Next Point-of-Interest Recommendation

1 code implementation4 Sep 2022 Jiayi Xie, Zhenzhong Chen

The stacking of encoders captures the latent hierarchical structure of the check-in sequence, which is used to predict the next visiting POI.

Exploring Long- and Short-Range Temporal Information for Learned Video Compression

1 code implementation7 Aug 2022 Huairui Wang, Zhenzhong Chen

Learned video compression methods have gained a variety of interest in the video coding community since they have matched or even exceeded the rate-distortion (RD) performance of traditional video codecs.

Motion Compensation Optical Flow Estimation +1

Learning Human Cognitive Appraisal Through Reinforcement Memory Unit

no code implementations6 Aug 2022 Yaosi Hu, Zhenzhong Chen

We conceptualize the memory-enhancing mechanism as Reinforcement Memory Unit (RMU) that contains an appraisal state together with two positive and negative reinforcement memories.

Video Quality Assessment

Learning Knowledge Representation with Meta Knowledge Distillation for Single Image Super-Resolution

no code implementations18 Jul 2022 Han Zhu, Zhenzhong Chen, Shan Liu

In addition, the KRNets are optimized in a meta-learning manner to ensure the knowledge transferring and the student learning are beneficial to improving the reconstructed quality of the student.

Image Super-Resolution Knowledge Distillation +1

Learned Video Compression via Heterogeneous Deformable Compensation Network

no code implementations11 Jul 2022 Huairui Wang, Zhenzhong Chen, Chang Wen Chen

In this paper, we propose a learned video compression framework via heterogeneous deformable compensation strategy (HDCVC) to tackle the problems of unstable compression performance caused by single-size deformable kernels in downsampled feature domain.

Motion Compensation Optical Flow Estimation +1

Deep Deconfounded Content-based Tag Recommendation for UGC with Causal Intervention

1 code implementation28 May 2022 Yaochen Zhu, Xubin Ren, Jing Yi, Zhenzhong Chen

We first establish a causal graph to represent the relations among uploader, UGC, and tag, where the uploaders are identified as confounders that spuriously correlate UGC and tag selections.

Recommendation Systems TAG

Multi-Auxiliary Augmented Collaborative Variational Auto-encoder for Tag Recommendation

no code implementations20 Apr 2022 Jing Yi, Xubin Ren, Zhenzhong Chen

Recommending appropriate tags to items can facilitate content organization, retrieval, consumption and other applications, where hybrid tag recommender systems have been utilized to integrate collaborative information and content information for better recommendations.

Recommendation Systems Retrieval +1

Pyramid Feature Alignment Network for Video Deblurring

no code implementations28 Mar 2022 Leitian Tao, Zhenzhong Chen

To better handle the challenges of complex and large motions, instead of aligning features at each scale separately, lower-scale motion information is used to guide the higher-scale motion estimation.

Deblurring Motion Estimation

Deep Causal Reasoning for Recommendations

1 code implementation6 Jan 2022 Yaochen Zhu, Jing Yi, Jiayi Xie, Zhenzhong Chen

As with all observational studies, hidden confounders, which are factors that affect both item exposures and user ratings, lead to a systematic bias in the estimation.

Recommendation Systems Variational Inference

Object-Relation Reasoning Graph for Action Recognition

no code implementations CVPR 2022 Yangjun Ou, Li Mi, Zhenzhong Chen

By combining an object-level graph (OG) and a relation-level graph (RG), the proposed OR2G catches the attribute transitions of objects and reasons about the relationship transitions between objects simultaneously.

Action Recognition Attribute +3

Make It Move: Controllable Image-to-Video Generation with Text Descriptions

1 code implementation CVPR 2022 Yaosi Hu, Chong Luo, Zhenzhong Chen

With both controllable appearance and motion, TI2V aims at generating videos from a static image and a text description.

Image to Video Generation

Optical Flow Reusing for High-Efficiency Space-Time Video Super Resolution

no code implementations13 Oct 2021 Yuantong Zhang, Huairui Wang, Han Zhu, Zhenzhong Chen

In this paper, we consider the task of space-time video super-resolution (ST-VSR), which can increase the spatial resolution and frame rate for a given video simultaneously.

Optical Flow Estimation Space-time Video Super-resolution +2

Cross-modal Variational Auto-encoder for Content-based Micro-video Background Music Recommendation

no code implementations15 Jul 2021 Jing Yi, Yaochen Zhu, Jiayi Xie, Zhenzhong Chen

Moreover, the multimodal information is fused by the product-of-experts (PoE) principle, where the semantic information in visual and textual modalities of the micro-video are weighted according to their variance estimations such that the modality with a lower noise level is given more weights.

Music Recommendation

Predicate correlation learning for scene graph generation

no code implementations6 Jul 2021 Leitian Tao, Li Mi, Nannan Li, Xianhang Cheng, Yaosi Hu, Zhenzhong Chen

For a typical Scene Graph Generation (SGG) method, there is often a large gap in the performance of the predicates' head classes and tail classes.

Graph Generation Scene Graph Generation

Visual Relationship Forecasting in Videos

no code implementations2 Jul 2021 Li Mi, Yangjun Ou, Zhenzhong Chen

To evaluate the VRF task, we introduce two video datasets named VRF-AG and VRF-VidOR, with a series of spatio-temporally localized visual relation annotations in a video.

Decision Making Object

Variational Bandwidth Auto-encoder for Hybrid Recommender Systems

1 code implementation17 May 2021 Yaochen Zhu, Zhenzhong Chen

Moreover, by considering the fusion of collaborative and feature variables as a virtual communication channel from an information-theoretic perspective, we introduce a user-dependent channel to dynamically control the information allowed to be accessed from the feature embeddings.

Recommendation Systems

Towards Visual Distortion in Black-Box Attacks

1 code implementation21 Jul 2020 Nannan Li, Zhenzhong Chen

Constructing adversarial examples in a black-box threat model injures the original images by introducing visual distortion.

Perceptual Distance

Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution

1 code implementation15 Jun 2020 Xianhang Cheng, Zhenzhong Chen

During the learning process, different intermediate time step can be involved as a control variable by means of an extension of coord-conv trick, allowing the estimated components to vary with different input temporal information.

Motion Estimation Optical Flow Estimation +1

Towards Mesh Saliency Detection in 6 Degrees of Freedom

no code implementations27 May 2020 Xiaoying Ding, Zhenzhong Chen

Traditional 3D mesh saliency detection algorithms and corresponding databases were proposed under several constraints such as providing limited viewing directions and not taking the subject's movement into consideration.

Saliency Detection

Predicting the Popularity of Micro-videos with Multimodal Variational Encoder-Decoder Framework

1 code implementation28 Mar 2020 Yaochen Zhu, Jiayi Xie, Zhenzhong Chen

As an emerging type of user-generated content, micro-video drastically enriches people's entertainment experiences and social interactions.

Learning Compact Reward for Image Captioning

no code implementations24 Mar 2020 Nannan Li, Zhenzhong Chen

Adversarial learning has shown its advances in generating natural and diverse descriptions in image captioning.

Image Captioning Reinforcement Learning (RL) +1

Learned Image Downscaling for Upscaling using Content Adaptive Resampler

5 code implementations22 Jul 2019 Wanjie Sun, Zhenzhong Chen

The proposed resampler network generates content adaptive image resampling kernels that are applied to the original HR input to generate pixels on the downscaled image.

 Ranked #1 on Image Super-Resolution on DIV2K val - 2x upscaling (using extra training data)

Image Super-Resolution

Obj-GloVe: Scene-Based Contextual Object Embedding

no code implementations2 Jul 2019 Canwen Xu, Zhenzhong Chen, Chenliang Li

Recently, with the prevalence of large-scale image dataset, the co-occurrence information among classes becomes rich, calling for a new way to exploit it to facilitate inference.

Dimensionality Reduction Image Generation +3

A Review-Driven Neural Model for Sequential Recommendation

no code implementations1 Jul 2019 Chenliang Li, Xichuan Niu, Xiangyang Luo, Zhenzhong Chen, Cong Quan

Given a sequence of historical purchased items for a user, we devise a novel hierarchical attention over attention mechanism to capture sequential patterns at both union-level and individual-level.

Collaborative Filtering Sequential Recommendation

Macroblock Classification Method for Video Applications Involving Motions

no code implementations28 Feb 2015 Weiyao Lin, Ming-Ting Sun, Hongxiang Li, Zhenzhong Chen, Wei Li, Bing Zhou

We demonstrate that this low-computation-complexity method can efficiently catch the characteristics of the frame.

Change Detection Classification +2

A Heat-Map-based Algorithm for Recognizing Group Activities in Videos

no code implementations21 Feb 2015 Weiyao Lin, Hang Chu, Jianxin Wu, Bin Sheng, Zhenzhong Chen

In this paper, a new heat-map-based (HMB) algorithm is proposed for group activity recognition.

Group Activity Recognition

Intra-and-Inter-Constraint-based Video Enhancement based on Piecewise Tone Mapping

no code implementations21 Feb 2015 Yuanzhe Chen, Weiyao Lin, Chongyang Zhang, Zhenzhong Chen, Ning Xu, Jun Xie

In this paper, we propose a new intra-and-inter-constraint-based video enhancement approach aiming to 1) achieve high intra-frame quality of the entire picture where multiple region-of-interests (ROIs) can be adaptively and simultaneously enhanced, and 2) guarantee the inter-frame quality consistencies among video frames.

Tone Mapping Video Enhancement

Cannot find the paper you are looking for? You can Submit a new open access paper.