Search Results for author: Zhongzhen Huang

Found 8 papers, 2 papers with code

Tool Calling: Enhancing Medication Consultation via Retrieval-Augmented Large Language Models

no code implementations27 Apr 2024 Zhongzhen Huang, Kui Xue, Yongqi Fan, Linjie Mu, Ruoyu Liu, Tong Ruan, Shaoting Zhang, Xiaofan Zhang

With experimental results, we show that our framework brings notable performance improvements and surpasses the previous counterparts in the evidence retrieval process in terms of evidence retrieval accuracy.

Grounded Knowledge-Enhanced Medical VLP for Chest X-Ray

no code implementations23 Apr 2024 Qiao Deng, Zhongzhen Huang, Yunqi Wang, Zhichuan Wang, Zhao Wang, Xiaofan Zhang, Qi Dou, Yeung Yu Hui, Edward S. Hui

Medical vision-language pre-training has emerged as a promising approach for learning domain-general representations of medical image and text.

Medical Visual Question Answering Question Answering +1

GuideGen: A Text-guided Framework for Joint CT Volume and Anatomical structure Generation

1 code implementation12 Mar 2024 Linrui Dai, Rongzhao Zhang, Zhongzhen Huang, Xiaofan Zhang

Secondly, our Conditional Image Generator autoregressively generates CT slices conditioned on a corresponding mask slice to incorporate both style information and anatomical guidance.

Modality-Aware and Shift Mixer for Multi-modal Brain Tumor Segmentation

no code implementations4 Mar 2024 Zhongzhen Huang, Linda Wei, Shaoting Zhang, Xiaofan Zhang

Combining images from multi-modalities is beneficial to explore various information in computer vision, especially in the medical domain.

Brain Tumor Segmentation Segmentation +1

ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting

1 code implementation7 Dec 2023 Yankai Jiang, Zhongzhen Huang, Rongzhao Zhang, Xiaofan Zhang, Shaoting Zhang

The long-tailed distribution problem in medical image analysis reflects a high prevalence of common conditions and a low prevalence of rare ones, which poses a significant challenge in developing a unified model capable of identifying rare or novel tumor categories not encountered during training.

Organ Segmentation Segmentation +1

KiUT: Knowledge-injected U-Transformer for Radiology Report Generation

no code implementations CVPR 2023 Zhongzhen Huang, Xiaofan Zhang, Shaoting Zhang

Radiology report generation aims to automatically generate a clinically accurate and coherent paragraph from the X-ray image, which could relieve radiologists from the heavy burden of report writing.

Clinical Knowledge

One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning

no code implementations31 Jul 2022 Zhipeng Zhang, Zhimin Wei, Zhongzhen Huang, Rui Niu, Peng Wang

However, one unsolved issue of these models is that the number of reasoning steps needs to be pre-defined and fixed before inference, ignoring the varying complexity of expressions.

Referring Expression Referring Expression Comprehension +2

Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information

no code implementations7 May 2022 Zhipeng Zhang, Xinglin Hou, Kai Niu, Zhongzhen Huang, Tiezheng Ge, Yuning Jiang, Qi Wu, Peng Wang

Therefore, we present a dataset, E-MMAD (e-commercial multimodal multi-structured advertisement copywriting), which requires, and supports much more detailed information in text generation.

Text Generation Video Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.