Search Results for author: ZiCheng Zhang

Found 55 papers, 24 papers with code

THQA: A Perceptual Quality Assessment Database for Talking Heads

1 code implementation13 Apr 2024 Yingjie Zhou, ZiCheng Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, Zhihua Wang, Xiao-Ping Zhang, Guangtao Zhai

In the realm of media technology, digital humans have gained prominence due to rapid advancements in computer technology.

Video Quality Assessment

AIGIQA-20K: A Large Database for AI-Generated Image Quality Assessment

no code implementations4 Apr 2024 Chunyi Li, Tengchuan Kou, Yixuan Gao, Yuqin Cao, Wei Sun, ZiCheng Zhang, Yingjie Zhou, Zhichao Zhang, Weixia Zhang, HaoNing Wu, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

With the rapid advancements in AI-Generated Content (AIGC), AI-Generated Images (AIGIs) have been widely applied in entertainment, education, and social media.

Image Quality Assessment

Graph Neural Aggregation-diffusion with Metastability

no code implementations29 Mar 2024 Kaiyuan Cui, Xinyan Wang, ZiCheng Zhang, Weichen Zhao

Due to the connection between graph diffusion and message passing, diffusion-based models have been widely studied.

Node Classification

Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation

no code implementations13 Mar 2024 ZiCheng Zhang, Tong Zhang, Yi Zhu, Jianzhuang Liu, Xiaodan Liang, Qixiang Ye, Wei Ke

To mitigate these issues, we propose a Language-Driven Visual Consensus (LDVC) approach, fostering improved alignment of semantic and visual information. Specifically, we leverage class embeddings as anchors due to their discrete and abstract nature, steering vision features toward class embeddings.

Language Modelling Semantic Segmentation +1

BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering

no code implementations10 Mar 2024 Xinmin Qiu, Congying Han, ZiCheng Zhang, Bonan Li, Tiande Guo, Pingyu Wang, Xuecheng Nie

Developing blind video deflickering (BVD) algorithms to enhance video temporal consistency, is gaining importance amid the flourish of image processing and video generation.

Video Generation Video Temporal Consistency

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

1 code implementation27 Feb 2024 ZiCheng Zhang, Ruobing Zheng, Ziwen Liu, Congying Han, Tianqi Li, Meng Wang, Tiande Guo, Jingdong Chen, Bonan Li, Ming Yang

Recent works in implicit representations, such as Neural Radiance Fields (NeRF), have advanced the generation of realistic and animatable head avatars from video sequences.

Towards Open-ended Visual Quality Comparison

no code implementations26 Feb 2024 HaoNing Wu, Hanwei Zhu, ZiCheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Chunyi Li, Annan Wang, Wenxiu Sun, Qiong Yan, Xiaohong Liu, Guangtao Zhai, Shiqi Wang, Weisi Lin

Comparative settings (e. g. pairwise choice, listwise ranking) have been adopted by a wide range of subjective studies for image quality assessment (IQA), as it inherently standardizes the evaluation criteria across different observers and offer more clear-cut responses.

Image Quality Assessment

MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model

1 code implementation26 Feb 2024 Chunyi Li, Guo Lu, Donghui Feng, HaoNing Wu, ZiCheng Zhang, Xiaohong Liu, Guangtao Zhai, Weisi Lin, Wenjun Zhang

With the evolution of storage and communication protocols, ultra-low bitrate image compression has become a highly demanding topic.

Image Compression

A Benchmark for Multi-modal Foundation Models on Low-level Vision: from Single Images to Pairs

1 code implementation11 Feb 2024 ZiCheng Zhang, HaoNing Wu, Erli Zhang, Guangtao Zhai, Weisi Lin

To this end, we design benchmark settings to emulate human language responses related to low-level vision: the low-level visual perception (A1) via visual question answering related to low-level attributes (e. g. clarity, lighting); and the low-level visual description (A2), on evaluating MLLMs for low-level text descriptions.

Image Quality Assessment Question Answering +1

Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error

no code implementations3 Feb 2024 Haoran Li, ZiCheng Zhang, Wang Luo, Congying Han, Yudong Hu, Tiande Guo, Shichen Liao

Establishing robust policies is essential to counter attacks or disturbances affecting deep reinforcement learning (DRL) agents.

Adversarial Robustness Q-Learning

AttentionLut: Attention Fusion-based Canonical Polyadic LUT for Real-time Image Enhancement

no code implementations3 Jan 2024 Kang Fu, Yicong Peng, ZiCheng Zhang, Qihang Xu, Xiaohong Liu, Jia Wang, Guangtao Zhai

Subsequently, the attention fusion module integrates the image feature with the priori attention feature obtained during training to generate image-adaptive canonical polyadic tensors.

Image Enhancement

Q-Refine: A Perceptual Quality Refiner for AI-Generated Image

no code implementations2 Jan 2024 Chunyi Li, HaoNing Wu, ZiCheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai

With the rapid evolution of the Text-to-Image (T2I) model in recent years, their unsatisfactory generation result has become a challenge.

Image Quality Assessment

Exploring the Naturalness of AI-Generated Images

1 code implementation9 Dec 2023 Zijian Chen, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhongpeng Ji, Fengyu Sun, Shangling Jui, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

In this paper, we take the first step to benchmark and assess the visual naturalness of AI-generated images.

FS-BAND: A Frequency-Sensitive Banding Detector

no code implementations30 Nov 2023 Zijian Chen, Wei Sun, ZiCheng Zhang, Ru Huang, Fangfang Lu, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

Banding artifact, as known as staircase-like contour, is a common quality annoyance that happens in compression, transmission, etc.

Image Quality Assessment

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

1 code implementation12 Nov 2023 HaoNing Wu, ZiCheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Annan Wang, Kaixin Xu, Chunyi Li, Jingwen Hou, Guangtao Zhai, Geng Xue, Wenxiu Sun, Qiong Yan, Weisi Lin

Multi-modality foundation models, as represented by GPT-4V, have brought a new paradigm for low-level visual perception and understanding tasks, that can respond to a broad range of natural human instructions in a model.

A No-Reference Quality Assessment Method for Digital Human Head

no code implementations25 Oct 2023 Yingjie Zhou, ZiCheng Zhang, Wei Sun, Xiongkuo Min, Xianghe Ma, Guangtao Zhai

In this paper, we develop a novel no-reference (NR) method based on Transformer to deal with DHQA in a multi-task manner.

Geometry-Aware Video Quality Assessment for Dynamic Digital Human

no code implementations24 Oct 2023 ZiCheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, Guangtao Zhai

Usually, DDHs are displayed as 2D rendered animation videos and it is natural to adapt video quality assessment (VQA) methods to DDH quality assessment (DDH-QA) tasks.

Attribute Video Quality Assessment +1

Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision

1 code implementation25 Sep 2023 HaoNing Wu, ZiCheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Annan Wang, Chunyi Li, Wenxiu Sun, Qiong Yan, Guangtao Zhai, Weisi Lin

To address this gap, we present Q-Bench, a holistic benchmark crafted to systematically evaluate potential abilities of MLLMs on three realms: low-level visual perception, low-level visual description, and overall visual quality assessment.

Image Quality Assessment

A Consumer-tier based Visual-Brain Machine Interface for Augmented Reality Glasses Interactions

no code implementations29 Aug 2023 Yuying Jiang, Fan Bai, ZiCheng Zhang, Xiaochen Ye, Zheng Liu, Zhiping Shi, Jianwei Yao, Xiaojun Liu, Fangkun Zhu, Junling Li Qian Guo, Xiaoan Wang, Junwen Luo

Here we develop a consumer-tier Visual-Brain Machine Inteface(V-BMI) system specialized for Augmented Reality(AR) glasses interactions.

MRA-GNN: Minutiae Relation-Aware Model over Graph Neural Network for Fingerprint Embedding

no code implementations31 Jul 2023 Yapeng Su, Tong Zhao, ZiCheng Zhang

However, previous works including CNN-based and Transformer-based approaches fail to exploit the nonstructural data, such as topology and correlation in fingerprints, which is essential to facilitate the identifiability and robustness of embedding.

Descriptive Graph Embedding +1

RAWIW: RAW Image Watermarking Robust to ISP Pipeline

no code implementations28 Jul 2023 Kang Fu, Xiaohong Liu, Jun Jia, ZiCheng Zhang, Yicong Peng, Jia Wang, Guangtao Zhai

To achieve end-to-end training of the framework, we integrate a neural network that simulates the ISP pipeline to handle the RAW-to-RGB conversion process.

Advancing Zero-Shot Digital Human Quality Assessment through Text-Prompted Evaluation

1 code implementation6 Jul 2023 ZiCheng Zhang, Wei Sun, Yingjie Zhou, HaoNing Wu, Chunyi Li, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

To address this gap, we propose SJTU-H3D, a subjective quality assessment database specifically designed for full-body digital humans.

GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment

1 code implementation9 Jun 2023 ZiCheng Zhang, Wei Sun, Houning Wu, Yingjie Zhou, Chunyi Li, Xiongkuo Min, Guangtao Zhai, Weisi Lin

Model-based 3DQA methods extract features directly from the 3D models, which are characterized by their high degree of complexity.

Point Cloud Quality Assessment

AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment

1 code implementation7 Jun 2023 Chunyi Li, ZiCheng Zhang, HaoNing Wu, Wei Sun, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

With the rapid advancements of the text-to-image generative model, AI-generated images (AGIs) have been widely applied to entertainment, education, social media, etc.

Image Quality Assessment

DiffBFR: Bootstrapping Diffusion Model Towards Blind Face Restoration

no code implementations8 May 2023 Xinmin Qiu, Congying Han, ZiCheng Zhang, Bonan Li, Tiande Guo, Xuecheng Nie

This design is implemented with two key components: 1) Identity Restoration Module (IRM) for preserving the face details in results.

Blind Face Restoration Denoising

MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos

1 code implementation CVPR 2023 ZiCheng Zhang, Wei Wu, Wei Sun, Dangyang Tu, Wei Lu, Xiongkuo Min, Ying Chen, Guangtao Zhai

User-generated content (UGC) live videos are often bothered by various distortions during capture procedures and thus exhibit diverse visual qualities.

Video Quality Assessment Visual Question Answering (VQA)

Transforming Radiance Field with Lipschitz Network for Photorealistic 3D Scene Stylization

no code implementations CVPR 2023 ZiCheng Zhang, Yinglu Liu, Congying Han, Yingwei Pan, Tiande Guo, Ting Yao

Simply coupling NeRF with photorealistic style transfer (PST) will result in cross-view inconsistency and degradation of stylized view syntheses.

Novel View Synthesis Style Transfer

A Perceptual Quality Assessment Exploration for AIGC Images

1 code implementation22 Mar 2023 ZiCheng Zhang, Chunyi Li, Wei Sun, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

\underline{AI} \underline{G}enerated \underline{C}ontent (\textbf{AIGC}) has gained widespread attention with the increasing efficiency of deep learning in content creation.

Image Quality Assessment

Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images

1 code implementation14 Mar 2023 ZiCheng Zhang, Wei Sun, Yingjie Zhou, Jun Jia, Zhichao Zhang, Jing Liu, Xiongkuo Min, Guangtao Zhai

Computer graphics images (CGIs) are artificially generated by means of computer programs and are widely perceived under various scenarios, such as games, streaming media, etc.

Image Quality Assessment NR-IQA

StyO: Stylize Your Face in Only One-Shot

no code implementations6 Mar 2023 Bonan Li, ZiCheng Zhang, Xuecheng Nie, Congying Han, Yinhan Hu, Tiande Guo

And it introduces a novel triple reconstruction loss to fine-tune the pre-trained LDM for encoding style and content into corresponding identifiers; 2) Fine-grained Content Controller (FCC) for the recombination phase.

Disentanglement One-Shot Face Stylization

EEP-3DQA: Efficient and Effective Projection-based 3D Model Quality Assessment

no code implementations17 Feb 2023 ZiCheng Zhang, Wei Sun, Yingjie Zhou, Wei Lu, Yucheng Zhu, Xiongkuo Min, Guangtao Zhai

Currently, great numbers of efforts have been put into improving the effectiveness of 3D model quality assessment (3DQA) methods.

DDH-QA: A Dynamic Digital Humans Quality Assessment Database

1 code implementation24 Dec 2022 ZiCheng Zhang, Yingjie Zhou, Wei Sun, Wei Lu, Xiongkuo Min, Yu Wang, Guangtao Zhai

In recent years, large amounts of effort have been put into pushing forward the real-world application of dynamic digital human (DDH).

Video Quality Assessment

CoupAlign: Coupling Word-Pixel with Sentence-Mask Alignments for Referring Image Segmentation

no code implementations4 Dec 2022 ZiCheng Zhang, Yi Zhu, Jianzhuang Liu, Xiaodan Liang, Wei Ke

Then in the Sentence-Mask Alignment (SMA) module, the masks are weighted by the sentence embedding to localize the referred object, and finally projected back to aggregate the pixels for the target.

Image Segmentation Semantic Segmentation +3

Perceptual Quality Assessment for Digital Human Heads

1 code implementation20 Sep 2022 ZiCheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, Yuzhe Wu, Guangtao Zhai

Digital humans are attracting more and more research interest during the last decade, the generation, representation, rendering, and animation of which have been put into large amounts of effort.

Generalized One-shot Domain Adaptation of Generative Adversarial Networks

2 code implementations8 Sep 2022 ZiCheng Zhang, Yinglu Liu, Congying Han, Tiande Guo, Ting Yao, Tao Mei

While previous works mainly focus on style transfer, we propose a novel and concise framework to address the \textit{generalized one-shot adaptation} task for both style and entity transfer, in which a reference image and its binary entity mask are provided.

Domain Adaptation Generative Adversarial Network +1

MM-PCQA: Multi-Modal Learning for No-reference Point Cloud Quality Assessment

1 code implementation1 Sep 2022 ZiCheng Zhang, Wei Sun, Xiongkuo Min, Quan Zhou, Jun He, Qiyuan Wang, Guangtao Zhai

In specific, we split the point clouds into sub-models to represent local geometry distortions such as point shift and down-sampling.

Point Cloud Quality Assessment

Evaluating Point Cloud from Moving Camera Videos: A No-Reference Metric

1 code implementation30 Aug 2022 ZiCheng Zhang, Wei Sun, Yucheng Zhu, Xiongkuo Min, Wei Wu, Ying Chen, Guangtao Zhai

To tackle the challenge of point cloud quality assessment (PCQA), many PCQA methods have been proposed to evaluate the visual quality levels of point clouds by assessing the rendered static 2D projections.

Image Quality Assessment Point Cloud Quality Assessment +2

Subjective Quality Assessment for Images Generated by Computer Graphics

no code implementations10 Jun 2022 Tao Wang, ZiCheng Zhang, Wei Sun, Xiongkuo Min, Wei Lu, Guangtao Zhai

However, limited work has been put forward to tackle the problem of computer graphics generated images' quality assessment (CG-IQA).

No-Reference Image Quality Assessment NR-IQA

A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences

no code implementations9 Jun 2022 Yu Fan, ZiCheng Zhang, Wei Sun, Xiongkuo Min, Wei Lu, Tao Wang, Ning Liu, Guangtao Zhai

Point cloud is one of the most widely used digital formats of 3D models, the visual quality of which is quite sensitive to distortions such as downsampling, noise, and compression.

Point Cloud Quality Assessment

A No-Reference Deep Learning Quality Assessment Method for Super-resolution Images Based on Frequency Maps

no code implementations9 Jun 2022 ZiCheng Zhang, Wei Sun, Xiongkuo Min, Wenhan Zhu, Tao Wang, Wei Lu, Guangtao Zhai

Therefore, in this paper, we propose a no-reference deep-learning image quality assessment method based on frequency maps because the artifacts caused by SISR algorithms are quite sensitive to frequency information.

Image Quality Assessment Image Super-Resolution

Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency

no code implementations9 Jun 2022 Wei Lu, Wei Sun, Wenhan Zhu, Xiongkuo Min, ZiCheng Zhang, Tao Wang, Guangtao Zhai

In this paper, we first conduct an example experiment (i. e. the face detection task) to demonstrate that the quality of the SIs has a crucial impact on the performance of the IVSS, and then propose a saliency-based deep neural network for the blind quality assessment of the SIs, which helps IVSS to filter the low-quality SIs and improve the detection and recognition performance.

Face Detection Image Quality Assessment

Deep Neural Network for Blind Visual Quality Assessment of 4K Content

no code implementations9 Jun 2022 Wei Lu, Wei Sun, Xiongkuo Min, Wenhan Zhu, Quan Zhou, Jun He, Qiyuan Wang, ZiCheng Zhang, Tao Wang, Guangtao Zhai

In this paper, we propose a deep learning-based BIQA model for 4K content, which on one hand can recognize true and pseudo 4K content and on the other hand can evaluate their perceptual visual quality.

4k Blind Image Quality Assessment +1

Perceptual Quality Assessment for Fine-Grained Compressed Images

no code implementations8 Jun 2022 ZiCheng Zhang, Wei Sun, Wei Wu, Ying Chen, Xiongkuo Min, Guangtao Zhai

Nowadays, the mainstream full-reference (FR) metrics are effective to predict the quality of compressed images at coarse-grained levels (the bit rates differences of compressed images are obvious), however, they may perform poorly for fine-grained compressed images whose bit rates differences are quite subtle.

Image Compression Image Quality Assessment

PetsGAN: Rethinking Priors for Single Image Generation

2 code implementations3 Mar 2022 ZiCheng Zhang, Yinglu Liu, Congying Han, Hailin Shi, Tiande Guo, BoWen Zhou

Moreover, we apply our method to other image manipulation tasks (e. g., style transfer, harmonization), and the results further prove the effectiveness and efficiency of our method.

Image Generation Image Manipulation +2

No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models

2 code implementations5 Jul 2021 ZiCheng Zhang, Wei Sun, Xiongkuo Min, Tao Wang, Wei Lu, Guangtao Zhai

Therefore, many related studies such as point cloud quality assessment (PCQA) and mesh quality assessment (MQA) have been carried out to measure the visual quality degradations of 3D models.

Point Cloud Quality Assessment

ExSinGAN: Learning an Explainable Generative Model from a Single Image

no code implementations16 May 2021 ZiCheng Zhang, Congying Han, Tiande Guo

Generating images from a single sample, as a newly developing branch of image synthesis, has attracted extensive attention.

Image Generation Image Manipulation

Multi-Agent Semi-Siamese Training for Long-tail and Shallow Face Learning

no code implementations10 May 2021 Hailin Shi, Dan Zeng, Yichun Tai, Hang Du, Yibo Hu, ZiCheng Zhang, Tao Mei

However, unlike the existing public face datasets, in many real-world scenarios of face recognition, the depth of training dataset is shallow, which means only two face images are available for each ID.

Face Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.