Search Results for author: Huiyu Duan

Found 12 papers, 5 papers with code

How is Visual Attention Influenced by Text Guidance? Database and Model

no code implementations • 11 Apr 2024 • Yinan Sun, Xiongkuo Min, Huiyu Duan, Guangtao Zhai

Finally, considering the effect of text descriptions on visual attention, while most existing saliency models ignore this impact, we further propose a text-guided saliency (TGSal) prediction model, which extracts and integrates both image features and text features to predict the image saliency under various text-description conditions.

Saliency Prediction

Paper
Add Code

AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images

no code implementations • 1 Apr 2024 • Liu Yang, Huiyu Duan, Long Teng, Yucheng Zhu, Xiaohong Liu, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet

Finally, we conduct a benchmark experiment to evaluate the performance of state-of-the-art IQA models on our database.

Image Quality Assessment

Paper
Add Code

Perceptual Video Quality Assessment: A Survey

no code implementations • 5 Feb 2024 • Xiongkuo Min, Huiyu Duan, Wei Sun, Yucheng Zhu, Guangtao Zhai

Perceptual video quality assessment plays a vital role in the field of video processing due to the existence of quality degradations introduced in various stages of video signal acquisition, compression, transmission and display.

Video Quality Assessment

Paper
Add Code

Audio-visual Saliency for Omnidirectional Videos

no code implementations • 9 Nov 2023 • Yuxin Zhu, Xilei Zhu, Huiyu Duan, Jie Li, Kaiwei Zhang, Yucheng Zhu, Li Chen, Xiongkuo Min, Guangtao Zhai

Visual saliency prediction for omnidirectional videos (ODVs) has shown great significance and necessity for omnidirectional videos to help ODV coding, ODV transmission, ODV rendering, etc..

Saliency Prediction

Paper
Add Code

Perceptual Quality Assessment of Omnidirectional Audio-visual Signals

1 code implementation • 20 Jul 2023 • Xilei Zhu, Huiyu Duan, Yuqin Cao, Yuxin Zhu, Yucheng Zhu, Jing Liu, Li Chen, Xiongkuo Min, Guangtao Zhai

Omnidirectional videos (ODVs) play an increasingly important role in the application fields of medical, education, advertising, tourism, etc.

Paper
Code

AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence

1 code implementation • 1 Jul 2023 • Jiarui Wang, Huiyu Duan, Jing Liu, Shi Chen, Xiongkuo Min, Guangtao Zhai

In this paper, in order to get a better understanding of the human visual preferences for AIGIs, a large-scale IQA database for AIGC is established, which is named as AIGCIQA2023.

Image Quality Assessment Text-to-Image Generation

Paper
Code

Masked Autoencoders as Image Processors

1 code implementation • 30 Mar 2023 • Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Long Teng, Jia Wang, Guangtao Zhai

Recently, masked autoencoders (MAE) for feature pre-training have further unleashed the potential of Transformers, leading to state-of-the-art performances on various high-level vision tasks.

Ranked #4 on Image Defocus Deblurring on DPD (Dual-view)

Deblurring Image Defocus Deblurring +2

Paper
Code

Perceptual Quality Assessment of Omnidirectional Images

no code implementations • 6 Jul 2022 • Huiyu Duan, Guangtao Zhai, Xiongkuo Min, Yucheng Zhu, Yi Fang, Xiaokang Yang

The original and distorted omnidirectional images, subjective quality ratings, and the head and eye movement data together constitute the OIQA database.

Image Quality Assessment

Paper
Add Code

Saliency in Augmented Reality

1 code implementation • 18 Apr 2022 • Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Jing Li, Guangtao Zhai

Therefore, in this paper, we mainly analyze the interaction effect between background (BG) scenes and AR contents, and study the saliency prediction problem in AR.

Saliency Prediction

Paper
Code

Confusing Image Quality Assessment: Towards Better Augmented Reality Experience

1 code implementation • 11 Apr 2022 • Huiyu Duan, Xiongkuo Min, Yucheng Zhu, Guangtao Zhai, Xiaokang Yang, Patrick Le Callet

An objective metric termed CFIQA is also proposed to better evaluate the confusing image quality.

Image Quality Assessment

Paper
Code

Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows

no code implementations • 20 Mar 2022 • Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen

Iwin Transformer is a hierarchical Transformer which progressively performs token representation learning and token agglomeration within irregular windows.

Human-Object Interaction Detection Object +4

Paper
Add Code

End-to-End Human-Gaze-Target Detection with Transformers

no code implementations • CVPR 2022 • Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen

In contrast, we redefine the HGT detection task as detecting human head locations and their gaze targets, simultaneously.

Gaze Prediction object-detection +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.