Search Results for author: Zhihao Chen

Found 21 papers, 8 papers with code

MambaUIE&SR: Unraveling the Ocean's Secrets with Only 2.8 FLOPs

1 code implementation • 22 Apr 2024 • Zhihao Chen, Yiyuan Ge

In addition, combining CNN and Transformer can effectively combine global and local information for enhancement.

Paper
Code

FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on

no code implementations • 22 Apr 2024 • Chenhui Wang, Tao Chen, Zhihao Chen, Zhizhong Huang, Taoran Jiang, Qi Wang, Hongming Shan

Despite their impressive generative performance, latent diffusion model-based virtual try-on (VTON) methods lack faithfulness to crucial details of the clothes, such as style, pattern, and text.

Virtual Try-on

Paper
Add Code

MedRG: Medical Report Grounding with Multi-modal Large Language Model

no code implementations • 10 Apr 2024 • Ke Zou, Yang Bai, Zhihao Chen, Yang Zhou, Yidi Chen, Kai Ren, Meng Wang, Xuedong Yuan, Xiaojing Shen, Huazhu Fu

Medical Report Grounding is pivotal in identifying the most relevant regions in medical images based on a given phrase query, a critical aspect in medical image analysis and radiological diagnosis.

Language Modelling Large Language Model +2

Paper
Add Code

Part-Attention Based Model Make Occluded Person Re-Identification Stronger

no code implementations • 4 Apr 2024 • Zhihao Chen, Yiyuan Ge

However, occluded person ReID still suffers from background clutter and low-quality local feature representations, which limits model performance.

Human Parsing Person Re-Identification

Paper
Add Code

Occluded Cloth-Changing Person Re-Identification

1 code implementation • 13 Mar 2024 • Zhihao Chen, Yiyuan Ge

We define cloth-changing person re-identification in occlusion scenarios as occluded cloth-changing person re-identification (Occ-CC-ReID), and to the best of our knowledge, we are the first to propose occluded cloth-changing person re-identification as a new task.

Cloth-Changing Person Re-Identification

Paper
Code

Low-dose CT Denoising with Language-engaged Dual-space Alignment

1 code implementation • 10 Mar 2024 • Zhihao Chen, Tao Chen, Chenhui Wang, Chuang Niu, Ge Wang, Hongming Shan

While various deep learning methods were proposed for low-dose computed tomography (CT) denoising, they often suffer from over-smoothing, blurring, and lack of explainability.

Computed Tomography (CT) Denoising

Paper
Code

Exploiting Emotion-Semantic Correlations for Empathetic Response Generation

1 code implementation • 27 Feb 2024 • Zhou Yang, Zhaochun Ren, Yufeng Wang, Xiaofei Zhu, Zhihao Chen, Tiecheng Cai, Yunbing Wu, Yisong Su, Sibo Ju, Xiangwen Liao

Based on dynamic emotion-semantic vectors and dependency trees, we propose a dynamic correlation graph convolutional network to guide the model in learning context meanings in dialogue and generating empathetic responses.

Dialogue Generation Empathetic Response Generation +1

Paper
Code

Training-free image style alignment for self-adapting domain shift on handheld ultrasound devices

no code implementations • 17 Feb 2024 • Hongye Zeng, Ke Zou, Zhihao Chen, Yuchong Gao, Hongbo Chen, Haibin Zhang, Kang Zhou, Meng Wang, Rick Siow Mong Goh, Yong liu, Chang Jiang, Rui Zheng, Huazhu Fu

Moreover, the models trained on standard ultrasound device data are constrained by training data distribution and perform poorly when directly applied to handheld device data.

Paper
Add Code

IQAGPT: Image Quality Assessment with Vision-language and ChatGPT Models

no code implementations • 25 Dec 2023 • Zhihao Chen, Bin Hu, Chuang Niu, Tao Chen, Yuxin Li, Hongming Shan, Ge Wang

Second, we fine-tune the image quality captioning VLM on the CT-IQA dataset to generate quality descriptions.

Image Quality Assessment

Paper
Add Code

AATCT-IDS: A Benchmark Abdominal Adipose Tissue CT Image Dataset for Image Denoising, Semantic Segmentation, and Radiomics Evaluation

no code implementations • 16 Aug 2023 • Zhiyu Ma, Chen Li, Tianming Du, Le Zhang, Dechao Tang, Deguo Ma, Shanchuan Huang, Yan Liu, Yihao Sun, Zhihao Chen, Jin Yuan, Qianqing Nie, Marcin Grzegorzek, Hongzan Sun

In the comparative study of semantic segmentation of abdominal adipose tissue, the segmentation results of adipose tissue by each model show different structural characteristics.

Image Denoising Segmentation +1

Paper
Add Code

ASCON: Anatomy-aware Supervised Contrastive Learning Framework for Low-dose CT Denoising

1 code implementation • 23 Jul 2023 • Zhihao Chen, Qi Gao, Yi Zhang, Hongming Shan

In this paper, we propose a novel Anatomy-aware Supervised CONtrastive learning framework, termed ASCON, which can explore the anatomical semantics for low-dose CT denoising while providing anatomical interpretability.

Anatomy Computed Tomography (CT) +2

Paper
Code

Learning Physical-Spatio-Temporal Features for Video Shadow Removal

no code implementations • 16 Mar 2023 • Zhihao Chen, Liang Wan, Yefan Xiao, Lei Zhu, Huazhu Fu

Then, we develop a progressive aggregation module to enhance the spatio and temporal characteristics of features maps, and effectively integrate the three kinds of features.

Shadow Removal Video Restoration

Paper
Add Code

Medical Phrase Grounding with Region-Phrase Context Contrastive Alignment

no code implementations • 14 Mar 2023 • Zhihao Chen, Yang Zhou, Anh Tran, Junting Zhao, Liang Wan, Gideon Ooi, Lionel Cheng, Choon Hua Thng, Xinxing Xu, Yong liu, Huazhu Fu

To enable MedRPG to locate nuanced medical findings with better region-phrase correspondences, we further propose Tri-attention Context contrastive alignment (TaCo).

Phrase Grounding Visual Grounding

Paper
Add Code

LIT-Former: Linking In-plane and Through-plane Transformers for Simultaneous CT Image Denoising and Deblurring

1 code implementation • 21 Feb 2023 • Zhihao Chen, Chuang Niu, Qi Gao, Ge Wang, Hongming Shan

Here, we propose to link in-plane and through-plane transformers for simultaneous in-plane denoising and through-plane deblurring, termed as LIT-Former, which can efficiently synergize in-plane and through-plane sub-tasks for 3D CT imaging and enjoy the advantages of both convolution and transformer networks.

Computed Tomography (CT) Deblurring +2

Paper
Code

A Review of Uncertainty Estimation and its Application in Medical Imaging

no code implementations • 16 Feb 2023 • Ke Zou, Zhihao Chen, Xuedong Yuan, Xiaojing Shen, Meng Wang, Huazhu Fu

We further discuss how they can be estimated in medical imaging.

Paper
Add Code

Feature Transformation for Cross-domain Few-shot Remote Sensing Scene Classification

no code implementations • 4 Mar 2022 • Qiaoling Chen, Zhihao Chen, Wei Luo

Moreover, FTM can be effectively learned on target domain in the case of few training data available and is agnostic to specific network structures.

Cross-Domain Few-Shot Scene Classification

Paper
Add Code

Deep Learning methods for automatic evaluation of delayed enhancement-MRI. The results of the EMIDEC challenge

no code implementations • 9 Aug 2021 • Alain Lalande, Zhihao Chen, Thibaut Pommier, Thomas Decourselle, Abdul Qayyum, Michel Salomon, Dominique Ginhac, Youssef Skandarani, Arnaud Boucher, Khawla Brahim, Marleen de Bruijne, Robin Camarasa, Teresa M. Correia, Xue Feng, Kibrom B. Girum, Anja Hennemuth, Markus Huellebrand, Raabid Hussain, Matthias Ivantsits, Jun Ma, Craig Meyer, Rishabh Sharma, Jixi Shi, Nikolaos V. Tsekos, Marta Varela, Xiyue Wang, Sen yang, Hannu Zhang, Yichi Zhang, Yuncheng Zhou, Xiahai Zhuang, Raphael Couturier, Fabrice Meriaudeau

The publicly available database consists of 150 exams divided into 50 cases with normal MRI after injection of a contrast agent and 100 cases with myocardial infarction (and then with a hyperenhanced area on DE-MRI), whatever their inclusion in the cardiac emergency department.

Paper
Add Code

Physics-informed generative neural network: an application to troposphere temperature prediction

no code implementations • 8 Jul 2021 • Zhihao Chen, Jie Gao, Weikai Wang, Zheng Yan

The generative neural network takes the mask as prior for the second-stage refined predictions.

Time Series Time Series Analysis

Paper
Add Code

Triple-cooperative Video Shadow Detection

1 code implementation • CVPR 2021 • Zhihao Chen, Liang Wan, Lei Zhu, Jia Shen, Huazhu Fu, Wennan Liu, Jing Qin

The bottleneck is the lack of a well-established dataset with high-quality annotations for video shadow detection.

Saliency Detection Semantic Segmentation +3

Paper
Code

A Multi-Task Mean Teacher for Semi-Supervised Shadow Detection

1 code implementation • CVPR 2020 • Zhihao Chen, Lei Zhu, Liang Wan, Song Wang, Wei Feng, Pheng-Ann Heng

To boost the shadow detection performance, this paper presents a multi-task mean teacher model for semi-supervised shadow detection by leveraging unlabeled data and exploring the learning of multiple information of shadows simultaneously.

Ranked #1 on Shadow Detection on SBU (using extra training data)

Shadow Detection

Paper
Code

Effects of Blur and Deblurring to Visual Object Tracking

no code implementations • 21 Aug 2019 • Qing Guo, Wei Feng, Zhihao Chen, Ruijun Gao, Liang Wan, Song Wang

In this paper, we address these two problems by constructing a Blurred Video Tracking benchmark, which contains a variety of videos with different levels of motion blurs, as well as ground truth tracking results for evaluating trackers.

Deblurring Image Deblurring +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.