Search Results for author: Hao Chen

Found 435 papers, 169 papers with code

Enhanced Multi-Channel Graph Convolutional Network for Aspect Sentiment Triplet Extraction

1 code implementation ACL 2022 Hao Chen, Zepeng Zhai, Fangxiang Feng, Ruifan Li, Xiaojie Wang

Specifically, we first define ten types of relations for ASTE task, and then adopt a biaffine attention module to embed these relations as an adjacent tensor between words in a sentence.

Aspect Sentiment Triplet Extraction Relation +1

Reinforced Counterfactual Data Augmentation for Dual Sentiment Classification

1 code implementation EMNLP 2021 Hao Chen, Rui Xia, Jianfei Yu

Data augmentation and adversarial perturbation approaches have recently achieved promising results in solving the over-fitting problem in many natural language processing (NLP) tasks including sentiment classification.

Classification counterfactual +4

Enhanced Representation with Contrastive Loss for Long-Tail Query Classification in e-commerce

no code implementations ECNLP (ACL) 2022 Lvxing Zhu, Hao Chen, Chao Wei, Weiru Zhang

To solve the above problem, we propose a novel method that leverages an auxiliary module to enhance the representations of long-tail queries by taking advantage of reliable supervised information of variant frequent queries.

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

no code implementations15 Apr 2024 Fangwei Zhong, Kui Wu, Hai Ci, Churan Wang, Hao Chen

We evaluate our tracker on several high-fidelity environments with challenging situations, such as distraction and occlusion.

Offline RL Q-Learning +2

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

2 code implementations10 Apr 2024 Hao Lu, Jiaqi Tang, Xinli Xu, Xu Cao, Yunpeng Zhang, Guoqing Wang, Dalong Du, Hao Chen, Yingcong Chen

Finally, for MC3D-Det joint training, the elaborate dataset merge strategy is designed to solve the problem of inconsistent camera numbers and camera parameters.

3D Object Detection Autonomous Driving +1

QMix: Quality-aware Learning with Mixed Noise for Robust Retinal Disease Diagnosis

no code implementations8 Apr 2024 Junlin Hou, Jilan Xu, Rui Feng, Hao Chen

Previous noise learning methods mainly considered noise arising from images being mislabeled, i. e. label noise, assuming that all mislabeled images are of high image quality.

MedIAnomaly: A comparative study of anomaly detection in medical images

1 code implementation6 Apr 2024 Yu Cai, Weiwen Zhang, Hao Chen, Kwang-Ting Cheng

Anomaly detection (AD) aims at detecting abnormal samples that deviate from the expected normal patterns.

Anomaly Classification Anomaly Detection +2

Rethinking Self-training for Semi-supervised Landmark Detection: A Selection-free Approach

1 code implementation6 Apr 2024 Haibo Jin, Haoxuan Che, Hao Chen

Self-training is a simple yet effective method for semi-supervised learning, during which pseudo-label selection plays an important role for handling confirmation bias.

Pseudo Label regression

Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions

1 code implementation4 Apr 2024 Yuting He, Fuxiang Huang, Xinrui Jiang, Yuxiang Nie, Minghao Wang, Jiguang Wang, Hao Chen

To answer these questions, a comprehensive and deep survey of the challenges, opportunities, and future directions of HFMs is presented in this survey.

RS-Mamba for Large Remote Sensing Image Dense Prediction

1 code implementation3 Apr 2024 Sijie Zhao, Hao Chen, Xueliang Zhang, Pengfeng Xiao, Lei Bai, Wanli Ouyang

RSM is specifically designed to capture the global context of remote sensing images with linear complexity, facilitating the effective processing of large VHR images.

Building change detection for remote sensing images Change Detection +1

Cohort-Individual Cooperative Learning for Multimodal Cancer Survival Analysis

no code implementations3 Apr 2024 Huajun Zhou, Fengtao Zhou, Hao Chen

In this paper, we propose a Cohort-individual Cooperative Learning (CCL) framework to advance cancer survival analysis by collaborating knowledge decomposition and cohort guidance.

Survival Analysis

360+x: A Panoptic Multi-modal Scene Understanding Dataset

no code implementations1 Apr 2024 Hao Chen, Yuqi Hou, Chenyuan Qu, Irene Testini, Xiaohan Hong, Jianbo Jiao

While many existing datasets focus on scene understanding from a certain perspective (e. g. egocentric or third-person views), our dataset offers a panoptic perspective (i. e. multiple viewpoints with multiple data modalities).

Scene Understanding

Design as Desired: Utilizing Visual Question Answering for Multimodal Pre-training

no code implementations30 Mar 2024 Tongkun Su, Jun Li, Xi Zhang, Haibo Jin, Hao Chen, Qiong Wang, Faqin Lv, Baoliang Zhao, Yin Hu

In this work, we leverage descriptions in medical reports to design multi-granular question-answer pairs associated with different diseases, which assist the framework in pre-training without requiring extra annotations from experts.

Contrastive Learning Question Answering +1

Dia-LLaMA: Towards Large Language Model-driven CT Report Generation

no code implementations25 Mar 2024 Zhixuan Chen, Luyang Luo, Yequan Bie, Hao Chen

Medical report generation has achieved remarkable advancements yet has still been faced with several challenges.

Language Modelling Large Language Model +2

AC4: Algebraic Computation Checker for Circuit Constraints in ZKPs

no code implementations23 Mar 2024 Hao Chen, Minyu Chen, Ruibang Liu, Guoqiang Li

ZKP systems have surged attention and held a fundamental role in contemporary cryptography.

Prompt-Guided Adaptive Model Transformation for Whole Slide Image Classification

no code implementations19 Mar 2024 Yi Lin, Zhengjie ZHU, Kwang-Ting Cheng, Hao Chen

To address this issue, we propose PAMT, a novel Prompt-guided Adaptive Model Transformation framework that enhances MIL classification performance by seamlessly adapting pre-trained models to the specific characteristics of histopathology data.

Image Classification Multiple Instance Learning +1

Advancing COVID-19 Detection in 3D CT Scans

no code implementations18 Mar 2024 Qingqiu Li, Runtian Yuan, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

To make a more accurate diagnosis of COVID-19, we propose a straightforward yet effective model.

Domain Adaptation Using Pseudo Labels for COVID-19 Detection

no code implementations18 Mar 2024 Runtian Yuan, Qingqiu Li, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

In response to the need for rapid and accurate COVID-19 diagnosis during the global pandemic, we present a two-stage framework that leverages pseudo labels for domain adaptation to enhance the detection of COVID-19 from CT scans.

COVID-19 Diagnosis Domain Adaptation +1

Self-Supervised Video Desmoking for Laparoscopic Surgery

1 code implementation17 Mar 2024 Renlong Wu, Zhilu Zhang, Shuohao Zhang, Longfei Gou, Haobin Chen, Lei Zhang, Hao Chen, WangMeng Zuo

On the other hand, in order to enhance the desmoking performance, we further feed the valuable information from PS frame into models, where a masking strategy and a regularization term are presented to avoid trivial solutions.

Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

no code implementations17 Mar 2024 Kangyang Xie, BinBin Yang, Hao Chen, Meng Wang, Cheng Zou, Hui Xue, Ming Yang, Chunhua Shen

Beyond the superiority of the text-to-image diffusion model in generating high-quality images, recent studies have attempted to uncover its potential for adapting the learned semantic knowledge to visual perception tasks.

Image Generation

3D Human Reconstruction in the Wild with Synthetic Data Using Generative Models

no code implementations17 Mar 2024 Yongtao Ge, Wenjia Wang, Yongfan Chen, Hao Chen, Chunhua Shen

In this work, we show that synthetic data created by generative models is complementary to computer graphics (CG) rendered data for achieving remarkable generalization performance on diverse real-world scenes for 3D human pose and shape estimation (HPS).

3D human pose and shape estimation 3D Human Reconstruction

CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning

1 code implementation15 Mar 2024 Yukun Li, Guansong Pang, Wei Suo, Chenchen Jing, Yuling Xi, Lingqiao Liu, Hao Chen, Guoqiang Liang, Peng Wang

Large pre-trained VLMs like CLIP have demonstrated superior zero-shot recognition ability, and a number of recent studies leverage this ability to mitigate catastrophic forgetting in CL, but they focus on closed-set CL in a single domain dataset.

Class Incremental Learning Incremental Learning +1

Histo-Genomic Knowledge Distillation For Cancer Prognosis From Histopathology Whole Slide Images

1 code implementation15 Mar 2024 Zhikang Wang, Yumeng Zhang, Yingxue Xu, Seiya Imoto, Hao Chen, Jiangning Song

G-HANet is expected to be explored as a useful tool by the research community to address the current bottleneck of insufficient histo-genomic data pairing in the context of cancer prognosis and precision oncology.

Benchmarking Knowledge Distillation +1

Rethinking Autoencoders for Medical Anomaly Detection from A Theoretical Perspective

no code implementations14 Mar 2024 Yu Cai, Hao Chen, Kwang-Ting Cheng

To the best of our knowledge, this is the first effort to theoretically clarify the principles and design philosophy of AE for anomaly detection.

Anomaly Detection Philosophy

Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification

no code implementations13 Mar 2024 Shuhan LI, Yi Lin, Hao Chen, Kwang-Ting Cheng

In this paper, we introduce an Iterative Online Image Synthesis (IOIS) framework to address the class imbalance problem in medical image classification.

Image Classification Image Generation +3

MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology

1 code implementation11 Mar 2024 Shu Yang, Yihui Wang, Hao Chen

Multiple Instance Learning (MIL) has emerged as a dominant paradigm to extract discriminative feature representations within Whole Slide Images (WSIs) in computational pathology.

Multiple Instance Learning whole slide images

Learning with Noisy Foundation Models

no code implementations11 Mar 2024 Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

Foundation models are usually pre-trained on large-scale datasets and then adapted to downstream tasks through tuning.

Diffusion Models Trained with Large Data Are Transferable Visual Models

no code implementations10 Mar 2024 Guangkai Xu, Yongtao Ge, MingYu Liu, Chengxiang Fan, Kangyang Xie, Zhiyue Zhao, Hao Chen, Chunhua Shen

We show that, simply initializing image understanding models using a pre-trained UNet (or transformer) of diffusion models, it is possible to achieve remarkable transferable performance on fundamental vision perception tasks using a moderate amount of target data (even synthetic data only), including monocular depth, surface normal, image segmentation, matting, human pose estimation, among virtually many others.

Image Matting Image Segmentation +2

HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction

1 code implementation8 Mar 2024 Zhengrui Guo, Jiabo Ma, Yingxue Xu, Yihui Wang, Liansheng Wang, Hao Chen

Histopathology serves as the gold standard in cancer diagnosis, with clinical reports being vital in interpreting and understanding this process, guiding cancer treatment and patient care.

Medical Report Generation Multiple Instance Learning +3

$\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations

2 code implementations7 Mar 2024 Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazak, Hao Chen, Xiaonan Huang, Bhiksha Raj

Referring perception, which aims at grounding visual objects with multimodal referring guidance, is essential for bridging the gap between humans, who provide instructions, and the environment where intelligent systems perceive.

Benchmarking

MolNexTR: A Generalized Deep Learning Model for Molecular Image Recognition

1 code implementation6 Mar 2024 Yufan Chen, Ching Ting Leung, Yong Huang, Jianwei Sun, Hao Chen, Hanyu Gao

In addition, it employs a series of novel augmentation algorithms to significantly enhance the robustness and performance of the model.

Data Augmentation

PI-AstroDeconv: A Physics-Informed Unsupervised Learning Method for Astronomical Image Deconvolution

no code implementations4 Mar 2024 Shulei Ni, Yisheng Qiu, YunChun Chen, Zihao Song, Hao Chen, Xuejian Jiang, Huaxi Chen

In the imaging process of an astronomical telescope, the deconvolution of its beam or Point Spread Function (PSF) is a crucial task.

Image Deconvolution

Boosting Box-supervised Instance Segmentation with Pseudo Depth

no code implementations2 Mar 2024 Xinyi Yu, Ling Yan, PengTao Jiang, Hao Chen, Bo Li, Lin Yuanbo Wu, Linlin Ou

This innovative approach empowers the network to simultaneously predict masks and depth, enhancing its ability to capture nuanced depth-related information during the instance segmentation process.

Box-supervised Instance Segmentation Depth Estimation +4

Data-efficient Event Camera Pre-training via Disentangled Masked Modeling

no code implementations1 Mar 2024 Zhenpeng Huang, Chao Li, Hao Chen, Yongjian Deng, Yifeng Geng, LiMin Wang

Our pre-training overcomes the limitations of previous methods, which either sacrifice temporal information by converting event sequences into 2D images for utilizing pre-trained image models or directly employ paired image data for knowledge distillation to enhance the learning of event streams.

Knowledge Distillation Self-Supervised Learning

A Novel Approach to Industrial Defect Generation through Blended Latent Diffusion Model with Online Adaptation

1 code implementation29 Feb 2024 Hanxi Li, Zhengxun Zhang, Hao Chen, Lin Wu, Bo Li, Deyin Liu, Mingwen Wang

Effectively addressing the challenge of industrial Anomaly Detection (AD) necessitates an ample supply of defective samples, a constraint often hindered by their scarcity in industrial contexts.

Anomaly Detection Image Generation

Anatomy-guided fiber trajectory distribution estimation for cranial nerves tractography

no code implementations29 Feb 2024 Lei Xie, Qingrun Zeng, Huajun Zhou, Guoqiang Xie, Mingchu Li, Jiahao Huang, Jianan Cui, Hao Chen, Yuanjing Feng

Diffusion MRI tractography is an important tool for identifying and analyzing the intracranial course of cranial nerves (CNs).

Anatomy

VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis

1 code implementation27 Feb 2024 Linshan Wu, Jiaxin Zhuang, Hao Chen

Through this pretext task, VoCo implicitly encodes the contextual position priors into model representations without the guidance of annotations, enabling us to effectively improve the performance of downstream tasks that require high-level semantics.

Contrastive Learning Position +1

Structure Guided Large Language Model for SQL Generation

no code implementations19 Feb 2024 Qinggang Zhang, Junnan Dong, Hao Chen, Wentao Li, Feiran Huang, Xiao Huang

Existing models typically input queries and database schemas into the LLM and rely on the LLM to perform semantic-structure matching and generate structured SQL.

Language Modelling Large Language Model

Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM

no code implementations18 Feb 2024 Zijin Hong, Zheng Yuan, Hao Chen, Qinggang Zhang, Feiran Huang, Xiao Huang

Generating accurate SQL for user queries (text-to-SQL) is a long-standing problem since the generation of the SQL requires comprehending the query and database and retrieving the accurate data from the database accordingly.

Text-To-SQL

Large Language Model Interaction Simulator for Cold-Start Item Recommendation

no code implementations14 Feb 2024 Feiran Huang, Zhenghang Yang, Junyi Jiang, Yuanchen Bei, Yijie Zhang, Hao Chen

To address this challenge, we propose an LLM Interaction Simulator (LLM-InS) to model users' behavior patterns based on the content aspect.

Collaborative Filtering Language Modelling +2

Multi-Behavior Collaborative Filtering with Partial Order Graph Convolutional Networks

no code implementations12 Feb 2024 Yijie Zhang, Yuanchen Bei, Hao Chen, Qijie Shen, Zheng Yuan, Huan Gong, Senzhang Wang, Feiran Huang, Xiao Huang

POG defines the partial order relation of multiple behaviors and models behavior combinations as weighted edges to merge separate behavior graphs into a joint POG.

Collaborative Filtering Recommendation Systems

Physics-Informed Neural Networks with Hard Linear Equality Constraints

1 code implementation11 Feb 2024 Hao Chen, Gonzalo E. Constante Flores, Can Li

The incorporation of physics into neural networks can improve generalization and data efficiency.

UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing

no code implementations4 Feb 2024 Yifeng He, Jiabo Huang, Yuyang Rong, Yiwen Guo, Ethan Wang, Hao Chen

The remarkable capability of large language models (LLMs) in generating high-quality code has drawn increasing attention in the software testing community.

A General Framework for Learning from Weak Supervision

1 code implementation2 Feb 2024 Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj

Weakly supervised learning generally faces challenges in applicability to various scenarios with diverse weak supervision and in scalability due to the complexity of existing algorithms, thereby hindering the practical deployment.

Weakly-supervised Learning

On Catastrophic Inheritance of Large Foundation Models

no code implementations2 Feb 2024 Hao Chen, Bhiksha Raj, Xing Xie, Jindong Wang

Large foundation models (LFMs) are claiming incredible performances.

ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast

1 code implementation2 Feb 2024 Wanghan Xu, Kang Chen, Tao Han, Hao Chen, Wanli Ouyang, Lei Bai

Data-driven weather forecast based on machine learning (ML) has experienced rapid development and demonstrated superior performance in the global medium-range forecast compared to traditional physics-based dynamical models.

Value prediction

Uncertainty-Aware Explainable Recommendation with Large Language Models

no code implementations31 Jan 2024 Yicui Peng, Hao Chen, ChingSheng Lin, Guo Huang, Jinrong Hu, Hui Guo, Bin Kong, Shu Hu, Xi Wu, Xin Wang

Providing explanations within the recommendation system would boost user satisfaction and foster trust, especially by elaborating on the reasons for selecting recommended items tailored to the user.

Explainable Recommendation Multi-Task Learning

Time Series Supplier Allocation via Deep Black-Litterman Model

1 code implementation30 Jan 2024 Jiayuan Luo, Wentao Zhang, Yuchen Fang, Xiaowei Gao, Dingyi Zhuang, Hao Chen, Xinke Jiang

Time Series Supplier Allocation (TSSA) poses a complex NP-hard challenge, aimed at refining future order dispatching strategies to satisfy order demands with maximum supply efficiency fully.

Navigate Time Series

Macro Graph Neural Networks for Online Billion-Scale Recommender Systems

1 code implementation26 Jan 2024 Hao Chen, Yuanchen Bei, Qijie Shen, Yue Xu, Sheng Zhou, Wenbing Huang, Feiran Huang, Senzhang Wang, Xiao Huang

Predicting Click-Through Rate (CTR) in billion-scale recommender systems poses a long-standing challenge for Graph Neural Networks (GNNs) due to the overwhelming computational complexity involved in aggregating billions of neighbors.

Recommendation Systems

Deep Joint Source-Channel Coding for Efficient and Reliable Cross-Technology Communication

no code implementations26 Jan 2024 Shumin Yao, Xiaodong Xu, Hao Chen, Yaping Sun, Qinglin Zhao

Cross-technology communication (CTC) is a promising technique that enables direct communications among incompatible wireless technologies without needing hardware modification.

Medical Image Debiasing by Learning Adaptive Agreement from a Biased Council

no code implementations22 Jan 2024 Luyang Luo, Xin Huang, Minghao Wang, Zhuoyue Wan, Hao Chen

Specifically, the debiasing model is required to learn adaptive agreement with the biased council by agreeing on the correctly predicted samples and disagreeing on the wrongly predicted samples by the biased council.

Attribute Image Classification +1

Observation-Guided Meteorological Field Downscaling at Station Scale: A Benchmark and a New Method

no code implementations22 Jan 2024 Zili Liu, Hao Chen, Lei Bai, Wenyuan Li, Keyan Chen, Zhengyi Wang, Wanli Ouyang, Zhengxia Zou, Zhenwei Shi

In this paper, we extend meteorological downscaling to arbitrary scattered station scales, establish a brand new benchmark and dataset, and retrieve meteorological states at any given station location from a coarse-resolution meteorological field.

Super-Resolution Weather Forecasting

Codebook-enabled Generative End-to-end Semantic Communication Powered by Transformer

no code implementations22 Jan 2024 PeiGen Ye, Yaping Sun, Shumin Yao, Hao Chen, Xiaodong Xu, Shuguang Cui

Codebook-based generative semantic communication attracts increasing attention, since only indices are required to be transmitted when the codebook is shared between transmitter and receiver.

Image Generation

MDGNN: Multi-Relational Dynamic Graph Neural Network for Comprehensive and Dynamic Stock Investment Prediction

no code implementations19 Jan 2024 Hao Qian, Hongting Zhou, Qian Zhao, Hao Chen, Hongxiang Yao, Jingwei Wang, Ziqi Liu, Fei Yu, Zhiqiang Zhang, Jun Zhou

The stock market is a crucial component of the financial system, but predicting the movement of stock prices is challenging due to the dynamic and intricate relations arising from various aspects such as economic indicators, financial reports, global news, and investor sentiment.

Distributed Task-Oriented Communication Networks with Multimodal Semantic Relay and Edge Intelligence

no code implementations18 Jan 2024 Jie Guo, Hao Chen, Bin Song, Yuhao Chi, Chau Yuen, Fei Richard Yu, Geoffrey Ye Li, Dusit Niyato

In this article, we present a novel framework, named distributed task-oriented communication networks (DTCN), based on recent advances in multimodal semantic transmission and edge intelligence.

Learning to detect cloud and snow in remote sensing images from noisy labels

no code implementations17 Jan 2024 Zili Liu, Hao Chen, Wenyuan Li, Keyan Chen, Zipeng Qi, Chenyang Liu, Zhengxia Zou, Zhenwei Shi

This paper is the first to consider the impact of label noise on the detection of clouds and snow in remote sensing images.

Semantic Segmentation

BoNuS: Boundary Mining for Nuclei Segmentation with Partial Point Labels

1 code implementation15 Jan 2024 Yi Lin, Zeyu Wang, Dong Zhang, Kwang-Ting Cheng, Hao Chen

To alleviate this problem, in this paper, we propose a weakly-supervised nuclei segmentation method that only requires partial point labels of nuclei.

Multiple Instance Learning Segmentation

TAROT: A Hierarchical Framework with Multitask Co-Pretraining on Semi-Structured Data towards Effective Person-Job Fit

no code implementations15 Jan 2024 Yihan Cao, Xu Chen, Lun Du, Hao Chen, Qiang Fu, Shi Han, Yushu Du, Yanbin Kang, Guangming Lu, Zi Li

Person-job fit is an essential part of online recruitment platforms in serving various downstream applications like Job Search and Candidate Recommendation.

DeepPhysiNet: Bridging Deep Learning and Atmospheric Physics for Accurate and Continuous Weather Modeling

1 code implementation4 Jan 2024 Wenyuan Li, Zili Liu, Keyan Chen, Hao Chen, Shunlin Liang, Zhengxia Zou, Zhenwei Shi

Next, we construct hyper-networks based on deep learning methods to directly learn weather patterns from a large amount of meteorological data.

Weather Forecasting

Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction

1 code implementation3 Jan 2024 Yilan Zhang, Yingxue Xu, Jianqi Chen, Fengying Xie, Hao Chen

Despite advantages of multimodal learning for cancer survival prediction, massive redundancy in multimodal data prevents it from extracting discriminative and compact information: (1) An extensive amount of intra-modal task-unrelated information blurs discriminability, especially for gigapixel whole slide images (WSIs) with many patches in pathology and thousands of pathways in genomic data, leading to an ``intra-modal redundancy" issue.

Disentanglement Survival Prediction +1

MOC-RVQ: Multilevel Codebook-assisted Digital Generative Semantic Communication

no code implementations2 Jan 2024 Yingbin Zhou, Yaping Sun, GuanYing Chen, Xiaodong Xu, Hao Chen, Binhong Huang, Shuguang Cui, Ping Zhang

Vector quantization-based image semantic communication systems have successfully boosted transmission efficiency, but face a challenge with conflicting requirements between codebook design and digital constellation modulation.

Quantization

Multimodal Sentiment Analysis with Missing Modality: A Knowledge-Transfer Approach

no code implementations28 Dec 2023 Weide Liu, Huijing Zhan, Hao Chen, Fengmao Lv

Multimodal sentiment analysis aims to identify the emotions expressed by individuals through visual, language, and acoustic cues.

Multimodal Sentiment Analysis Transfer Learning

Mobility and Cost Aware Inference Accelerating Algorithm for Edge Intelligence

no code implementations27 Dec 2023 Xin Yuan, Ning li, Kang Wei, Wenchao Xu, Quan Chen, Hao Chen, Song Guo

The model segmentation without user mobility has been investigated deeply by previous works.

Segmentation

Mutual Information as Intrinsic Reward of Reinforcement Learning Agents for On-demand Ride Pooling

no code implementations23 Dec 2023 Xianjie Zhang, Jiahao Sun, Chen Gong, Kai Wang, Yifei Cao, Hao Chen, Yu Liu

The emergence of on-demand ride pooling services allows each vehicle to serve multiple passengers at a time, thus increasing drivers' income and enabling passengers to travel at lower prices than taxi/car on-demand services (only one passenger can be assigned to a car at a time like UberX and Lyft).

Reinforcement Learning (RL)

Professional Network Matters: Connections Empower Person-Job Fit

no code implementations19 Dec 2023 Hao Chen, Lun Du, Yuxuan Lu, Qiang Fu, Xu Chen, Shi Han, Yanbin Kang, Guangming Lu, Zi Li

Online recruitment platforms typically employ Person-Job Fit models in the core service that automatically match suitable job seekers with appropriate job positions.

Towards an end-to-end artificial intelligence driven global weather forecasting system

no code implementations18 Dec 2023 Kun Chen, Lei Bai, Fenghua Ling, Peng Ye, Tao Chen, Jing-Jia Luo, Hao Chen, Yi Xiao, Kang Chen, Tao Han, Wanli Ouyang

Initial states are typically generated by traditional data assimilation components, which are computational expensive and time-consuming.

Weather Forecasting

Model-Free Change Point Detection for Mixing Processes

no code implementations14 Dec 2023 Hao Chen, Abhishek Gupta, Yin Sun, Ness Shroff

In particular, we provide performance guarantees for the MMD-CUSUM test under $\alpha$, $\beta$, and $\phi$-mixing processes, which significantly expands its utility beyond the i. i. d.

Change Point Detection

PromptBench: A Unified Library for Evaluation of Large Language Models

1 code implementation13 Dec 2023 Kaijie Zhu, Qinlin Zhao, Hao Chen, Jindong Wang, Xing Xie

The evaluation of large language models (LLMs) is crucial to assess their performance and mitigate potential security risks.

Prompt Engineering

KnowGPT: Knowledge Injection for Large Language Models

no code implementations11 Dec 2023 Qinggang Zhang, Junnan Dong, Hao Chen, Daochen Zha, Zailiang Yu, Xiao Huang

Generative Large Language Models (LLMs), such as ChatGPT, offer interactive APIs that can answer common questions at a human-expert level.

Knowledge Graphs Question Answering +1

Reinforcement Neighborhood Selection for Unsupervised Graph Anomaly Detection

no code implementations9 Dec 2023 Yuanchen Bei, Sheng Zhou, Qiaoyu Tan, Hao Xu, Hao Chen, Zhao Li, Jiajun Bu

To address these issues, we utilize the advantages of reinforcement learning in adaptively learning in complex environments and propose a novel method that incorporates Reinforcement neighborhood selection for unsupervised graph ANomaly Detection (RAND).

Graph Anomaly Detection Representation Learning

Shapley Values-enabled Progressive Pseudo Bag Augmentation for Whole Slide Image Classification

no code implementations9 Dec 2023 Renao Yan, Qiehe Sun, Cheng Jin, Yiqing Liu, Yonghong He, Tian Guan, Hao Chen

While most of the conventional MIL methods use attention scores to estimate instance importance scores (IIS) which contribute to the prediction of the slide labels, these often lead to skewed attention distributions and inaccuracies in identifying crucial instances.

Image Classification Multiple Instance Learning

Learning for Semantic Knowledge Base-Guided Online Feature Transmission in Dynamic Channels

no code implementations30 Nov 2023 Xiangyu Gao, Yaping Sun, Dongyu Wei, Xiaodong Xu, Hao Chen, Hao Yin, Shuguang Cui

In this context, we address the problem of efficient remote object recognition by optimizing feature transmission between mobile devices and edge servers.

Autonomous Vehicles Decision Making +2

HumanRecon: Neural Reconstruction of Dynamic Human Using Geometric Cues and Physical Priors

1 code implementation26 Nov 2023 Junhui Yin, Wei Yin, Hao Chen, Xuqian Ren, Zhanyu Ma, Jun Guo, Yifan Liu

These priors ensure the color rendered along rays to be robust to view direction and reduce the inherent ambiguities of density estimated along rays.

Novel View Synthesis

Hybrid Precoding and Combining for mmWave Full-Duplex Joint Radar and Communication Systems under Self-Interference

no code implementations25 Nov 2023 Murat Bayraktar, Nuria González-Prelcic, Hao Chen

Specifically, we introduce a generalized eigenvalue-based precoder design that considers the downlink user rate, the radar gain, and the SI suppression.

A Parameterized Generative Adversarial Network Using Cyclic Projection for Explainable Medical Image Classification

no code implementations24 Nov 2023 Xiangyu Xiong, Yue Sun, Xiaohong Liu, Chan-Tong Lam, Tong Tong, Hao Chen, Qinquan Gao, Wei Ke, Tao Tan

Although current data augmentation methods are successful to alleviate the data insufficiency, conventional augmentation are primarily intra-domain while advanced generative adversarial networks (GANs) generate images remaining uncertain, particularly in small-scale datasets.

Data Augmentation Generative Adversarial Network +2

Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey

1 code implementation21 Nov 2023 Yunpeng Huang, Jingwei Xu, Junyu Lai, Zixu Jiang, Taolue Chen, Zenan Li, Yuan YAO, Xiaoxing Ma, Lijuan Yang, Hao Chen, Shupeng Li, Penghao Zhao

Transformer-based Large Language Models (LLMs) have been applied in diverse areas such as knowledge bases, human interfaces, and dynamic agents, and marking a stride towards achieving Artificial General Intelligence (AGI).

Navigate

AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort

no code implementations19 Nov 2023 Wen Wang, Canyu Zhao, Hao Chen, Zhekai Chen, Kecheng Zheng, Chunhua Shen

We empirically find that sparse control conditions, such as bounding boxes, are suitable for layout planning, while dense control conditions, e. g., sketches and keypoints, are suitable for generating high-quality image content.

Image Generation Story Visualization

Knowledge Graph Construction in Power Distribution Networks

no code implementations15 Nov 2023 Xiang Li, Che Wang, Bing Li, Hao Chen, Sizhe Li

In this paper, we propose a method for knowledge graph construction in power distribution networks.

Entity Linking graph construction +1

Alleviating Behavior Data Imbalance for Multi-Behavior Graph Collaborative Filtering

no code implementations12 Nov 2023 Yijie Zhang, Yuanchen Bei, Shiqi Yang, Hao Chen, Zhiqing Li, Lijia Chen, Feiran Huang

To this end, we propose IMGCF, a simple but effective model to alleviate behavior data imbalance for multi-behavior graph collaborative filtering.

Collaborative Filtering Multi-Task Learning +1

Rethinking and Improving Multi-task Learning for End-to-end Speech Translation

1 code implementation7 Nov 2023 Yuhao Zhang, Chen Xu, Bei Li, Hao Chen, Tong Xiao, Chunliang Zhang, Jingbo Zhu

Significant improvements in end-to-end speech translation (ST) have been achieved through the application of multi-task learning.

Multi-Task Learning

Channel Estimation and Training Design for Active RIS Aided Wireless Communications

no code implementations6 Nov 2023 Hao Chen, Nanxi Li, Ruizhe Long, Ying-Chang Liang

To address this issue, we further investigate this ARIS-specific channel estimation problem and propose a least-square (LS) based channel estimator, whose performance can be further improved with the design on ARIS reflection patterns at the channel training phase.

CompeteAI: Understanding the Competition Behaviors in Large Language Model-based Agents

no code implementations26 Oct 2023 Qinlin Zhao, Jindong Wang, Yixuan Zhang, Yiqiao Jin, Kaijie Zhu, Hao Chen, Xing Xie

Large language models (LLMs) have been widely used as agents to complete different tasks, such as personal assistance or event planning.

Language Modelling Large Language Model

De novo protein design using geometric vector field networks

no code implementations18 Oct 2023 Weian Mao, Muzhi Zhu, Zheng Sun, Shuaike Shen, Lin Yuanbo Wu, Hao Chen, Chunhua Shen

Most prior encoders rely on atom-wise features, such as angles and distances between atoms, which are not available in this context.

Protein Design

Object-aware Inversion and Reassembly for Image Editing

no code implementations18 Oct 2023 Zhen Yang, Ganggui Ding, Wen Wang, Hao Chen, Bohan Zhuang, Chunhua Shen

Subsequently, we propose an additional reassembly step to seamlessly integrate the respective editing results and the non-editing region to obtain the final edited image.

Benchmarking Denoising +1

RGM: A Robust Generalizable Matching Model

1 code implementation18 Oct 2023 Songyan Zhang, Xinyu Sun, Hao Chen, Bo Li, Chunhua Shen

Finding corresponding pixels within a pair of images is a fundamental computer vision task with various applications.

Optical Flow Estimation

Towards Intelligent Network Management: Leveraging AI for Network Service Detection

no code implementations14 Oct 2023 Khuong N. Nguyen, Abhishek Sehgal, Yuming Zhu, Junsu Choi, Guanbo Chen, Hao Chen, Boon Loong Ng, Charlie Zhang

As the complexity and scale of modern computer networks continue to increase, there has emerged an urgent need for precise traffic analysis, which plays a pivotal role in cutting-edge wireless connectivity technologies.

Management Traffic Classification

Hoeffding's Inequality for Markov Chains under Generalized Concentrability Condition

no code implementations4 Oct 2023 Hao Chen, Abhishek Gupta, Yin Sun, Ness Shroff

This paper studies Hoeffding's inequality for Markov chains under the generalized concentrability condition defined via integral probability metric (IPM).

Completing Visual Objects via Bridging Generation and Segmentation

no code implementations1 Oct 2023 Xiang Li, Yinpeng Chen, Chung-Ching Lin, Hao Chen, Kai Hu, Rita Singh, Bhiksha Raj, Lijuan Wang, Zicheng Liu

This paper presents a novel approach to object completion, with the primary goal of reconstructing a complete object from its partially visible components.

Image Generation Object +1

Unpaired Optical Coherence Tomography Angiography Image Super-Resolution via Frequency-Aware Inverse-Consistency GAN

no code implementations29 Sep 2023 Weiwen Zhang, Dawei Yang, Haoxuan Che, An Ran Ran, Carol Y. Cheung, Hao Chen

For optical coherence tomography angiography (OCTA) images, a limited scanning rate leads to a trade-off between field-of-view (FOV) and imaging resolution.

Generative Adversarial Network Image Super-Resolution

Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

no code implementations29 Sep 2023 Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

This paper aims to understand the nature of noise in pre-training datasets and to mitigate its impact on downstream tasks.

Cross-Modal Translation and Alignment for Survival Analysis

1 code implementation ICCV 2023 Fengtao Zhou, Hao Chen

With the rapid advances in high-throughput sequencing technologies, the focus of survival analysis has shifted from examining clinical indicators to incorporating genomic profiles with pathological images.

Survival Analysis Survival Prediction +1

Enabling Quartile-based Estimated-Mean Gradient Aggregation As Baseline for Federated Image Classifications

no code implementations21 Sep 2023 Yusen Wu, Jamie Deng, Hao Chen, Phuong Nguyen, Yelena Yesha

Federated Learning (FL) has revolutionized how we train deep neural networks by enabling decentralized collaboration while safeguarding sensitive data and improving model performance.

Federated Learning

Soft Merging: A Flexible and Robust Soft Model Merging Approach for Enhanced Neural Network Performance

no code implementations21 Sep 2023 Hao Chen, Yusen Wu, Phuong Nguyen, Chao Liu, Yelena Yesha

This merging process not only enhances the model performance by converging to a better local optimum, but also minimizes computational costs, offering an efficient and explicit learning process integrated with stochastic gradient descent.

Progressive Feature Adjustment for Semi-supervised Learning from Pretrained Models

no code implementations9 Sep 2023 Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix

As an effective way to alleviate the burden of data annotation, semi-supervised learning (SSL) provides an attractive solution due to its ability to leverage both labeled and unlabeled data to build a predictive model.

Code Representation Pre-training with Complements from Program Executions

no code implementations4 Sep 2023 Jiabo Huang, Jianyu Zhao, Yuyang Rong, Yiwen Guo, Yifeng He, Hao Chen

The test cases are obtained with the assistance of a customized fuzzer and are only required during pre-training.

Code Search Language Modelling

DARC: Distribution-Aware Re-Coloring Model for Generalizable Nucleus Segmentation

1 code implementation1 Sep 2023 Shengcong Chen, Changxing Ding, DaCheng Tao, Hao Chen

Second, we propose a new instance normalization method that is robust to the variation in foreground-background ratios.

Segmentation

Unsupervised Domain Adaptation for Anatomical Landmark Detection

1 code implementation25 Aug 2023 Haibo Jin, Haoxuan Che, Hao Chen

The framework leverages self-training and domain adversarial learning to address the domain gap during adaptation.

Unsupervised Domain Adaptation

PromptMRG: Diagnosis-Driven Prompts for Medical Report Generation

1 code implementation24 Aug 2023 Haibo Jin, Haoxuan Che, Yi Lin, Hao Chen

To address these challenges, we propose diagnosis-driven prompts for medical report generation (PromptMRG), a novel framework that aims to improve the diagnostic accuracy of MRG with the guidance of diagnosis-aware prompts.

Medical Report Generation

Diagnosing Infeasible Optimization Problems Using Large Language Models

no code implementations23 Aug 2023 Hao Chen, Gonzalo E. Constante-Flores, Can Li

Decision-making problems can be represented as mathematical optimization models, finding wide applications in fields such as economics, engineering and manufacturing, transportation, and health care.

Chatbot Decision Making +1

In-Rack Test Tube Pose Estimation Using RGB-D Data

no code implementations21 Aug 2023 Hao Chen, Weiwei Wan, Masaki Matsushita, Takeyuki Kotaka, Kensuke Harada

Accurate robotic manipulation of test tubes in biology and medical industries is becoming increasingly important to address workforce shortages and improve worker safety.

Point Cloud Registration Pose Estimation

Karma: Adaptive Video Streaming via Causal Sequence Modeling

no code implementations20 Aug 2023 Bowei Xu, Hao Chen, Zhan Ma

Unlike direct observation-to-action mapping, Karma recurrently maintains a multi-dimensional time series of observations, returns, and actions as input and employs causal sequence modeling via a decision transformer to determine the next action.

Interpretation on Multi-modal Visual Fusion

no code implementations19 Aug 2023 Hao Chen, Haoran Zhou, Yongjian Deng

In this paper, we present an analytical framework and a novel metric to shed light on the interpretation of the multimodal vision community.

Better Zero-Shot Reasoning with Role-Play Prompting

2 code implementations15 Aug 2023 Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xin Zhou, Enzhi Wang, Xiaohang Dong

This highlights its potential to augment the reasoning capabilities of LLMs.

When Monte-Carlo Dropout Meets Multi-Exit: Optimizing Bayesian Neural Networks on FPGA

1 code implementation13 Aug 2023 Hongxiang Fan, Hao Chen, Liam Castelli, Zhiqiang Que, He Li, Kenneth Long, Wayne Luk

Bayesian Neural Networks (BayesNNs) have demonstrated their capability of providing calibrated prediction for safety-critical applications such as medical imaging and autonomous driving.

Autonomous Driving

Target before Shooting: Accurate Anomaly Detection and Localization under One Millisecond via Cascade Patch Retrieval

1 code implementation13 Aug 2023 Hanxi Li, Jianfei Hu, Bo Li, Hao Chen, Yongbin Zheng, Chunhua Shen

In this framework, the anomaly detection problem is solved via a cascade patch retrieval procedure that retrieves the nearest neighbors for each test image patch in a coarse-to-fine fashion.

Supervised Anomaly Detection

SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning

1 code implementation ICCV 2023 Muzhi Zhu, Hengtao Li, Hao Chen, Chengxiang Fan, Weian Mao, Chenchen Jing, Yifan Liu, Chunhua Shen

In this work, we propose a novel training mechanism termed SegPrompt that uses category information to improve the model's class-agnostic segmentation ability for both known and unknown categories.

Open-World Instance Segmentation Segmentation +1

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models

1 code implementation NeurIPS 2023 Weijia Wu, Yuzhong Zhao, Hao Chen, YuChao Gu, Rui Zhao, Yefei He, Hong Zhou, Mike Zheng Shou, Chunhua Shen

To showcase the power of the proposed approach, we generate datasets with rich dense pixel-wise labels for a wide range of downstream tasks, including semantic segmentation, instance segmentation, and depth estimation.

Depth Estimation Domain Generalization +5

Phase Matching for Out-of-Distribution Generalization

no code implementations24 Jul 2023 Chengming Hu, Yeqian Du, Rui Wang, Hao Chen

In this paper, we aim to clarify the relationships between Domain Generalization (DG) and the frequency components, and explore the spatial relationships of the phase spectrum.

Domain Generalization Out-of-Distribution Generalization +1

Collaborative Graph Neural Networks for Attributed Network Embedding

1 code implementation22 Jul 2023 Qiaoyu Tan, Xin Zhang, Xiao Huang, Hao Chen, Jundong Li, Xia Hu

Graph neural networks (GNNs) have shown prominent performance on attributed network embedding.

Attribute Network Embedding

Improving Transferability of Adversarial Examples via Bayesian Attacks

no code implementations21 Jul 2023 Qizhang Li, Yiwen Guo, Xiaochen Yang, WangMeng Zuo, Hao Chen

Our ICLR work advocated for enhancing transferability in adversarial examples by incorporating a Bayesian formulation into model parameters, which effectively emulates the ensemble of infinitely many deep neural networks, while, in this paper, we introduce a novel extension by incorporating the Bayesian formulation into the model input as well, enabling the joint diversification of both the model input and model parameters.

Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image

1 code implementation ICCV 2023 Wei Yin, Chi Zhang, Hao Chen, Zhipeng Cai, Gang Yu, Kaixuan Wang, Xiaozhi Chen, Chunhua Shen

State-of-the-art (SOTA) monocular metric depth estimation methods can only handle a single camera model and are unable to perform mixed-data training due to the metric ambiguity.

Ranked #18 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

Image Reconstruction Monocular Depth Estimation +1

Image Captions are Natural Prompts for Text-to-Image Models

1 code implementation17 Jul 2023 Shiye Lei, Hao Chen, Sen Zhang, Bo Zhao, DaCheng Tao

With the rapid development of Artificial Intelligence Generated Content (AIGC), it has become common practice in many learning tasks to train or fine-tune large models on synthetic data due to the data-scarcity and privacy leakage problems.

Image Captioning Image Generation

Dense Affinity Matching for Few-Shot Segmentation

no code implementations17 Jul 2023 Hao Chen, Yonghan Dong, Zheming Lu, Yunlong Yu, Yingming Li, Jungong Han, Zhongfei Zhang

Few-Shot Segmentation (FSS) aims to segment the novel class images with a few annotated samples.

Few-Shot Semantic Segmentation

Towards Generalizable Diabetic Retinopathy Grading in Unseen Domains

1 code implementation10 Jul 2023 Haoxuan Che, YuHan Cheng, Haibo Jin, Hao Chen

Diabetic Retinopathy (DR) is a common complication of diabetes and a leading cause of blindness worldwide.

Diabetic Retinopathy Grading Domain Generalization

A Survey on Evaluation of Large Language Models

1 code implementation6 Jul 2023 Yupeng Chang, Xu Wang, Jindong Wang, Yuan Wu, Linyi Yang, Kaijie Zhu, Hao Chen, Xiaoyuan Yi, Cunxiang Wang, Yidong Wang, Wei Ye, Yue Zhang, Yi Chang, Philip S. Yu, Qiang Yang, Xing Xie

Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications.

Ethics

Low-Light Enhancement in the Frequency Domain

no code implementations29 Jun 2023 Hao Chen, Zhi Jin

Hence, in this work, we propose a novel residual recurrent multi-wavelet convolutional neural network R2-MWCNN learned in the frequency domain that can simultaneously increase the image contrast and reduce noise signals well.

Image Enhancement object-detection +1

RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model

1 code implementation28 Jun 2023 Keyan Chen, Chenyang Liu, Hao Chen, Haotian Zhang, Wenyuan Li, Zhengxia Zou, Zhenwei Shi

We also propose several ongoing derivatives for instance segmentation tasks, drawing on recent advancements within the SAM community, and compare their performance with RSPrompter.

Image Segmentation Instance Segmentation +2

Deep Omni-supervised Learning for Rib Fracture Detection from Chest Radiology Images

1 code implementation23 Jun 2023 Zhizhong Chai, Luyang Luo, Huangjing Lin, Pheng-Ann Heng, Hao Chen

To tackle this challenge, the literature on object detection has witnessed an increase of weakly-supervised and semi-supervised approaches, yet still lacks a unified framework that leverages various forms of fully-labeled, weakly-labeled, and unlabeled data.

object-detection Object Detection

Distributed Localization and Tracking Control for Nonholonomic Agents with Time-varying Bearing Formation

no code implementations19 Jun 2023 Huiming Li, Hao Chen, Xiangke Wang, Mengge Zhang, Lincheng Shen

This paper studies the bearing-based time-varying formation control problem for unicycle-type agents without bearing rigidity conditions.

Advancing Volumetric Medical Image Segmentation via Global-Local Masked Autoencoder

no code implementations15 Jun 2023 Jia-Xin Zhuang, Luyang Luo, Hao Chen

Masked autoencoder (MAE) is a promising self-supervised pre-training technique that can improve the representation learning of a neural network without human intervention.

Image Segmentation Representation Learning +2

Multimodal Optimal Transport-based Co-Attention Transformer with Global Structure Consistency for Survival Prediction

1 code implementation ICCV 2023 Yingxue Xu, Hao Chen

Survival prediction is a complicated ordinal regression task that aims to predict the ranking risk of death, which generally benefits from the integration of histology and genomic data.

Survival Analysis Survival Prediction +1

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization

2 code implementations8 Jun 2023 Yidong Wang, Zhuohao Yu, Zhengran Zeng, Linyi Yang, Cunxiang Wang, Hao Chen, Chaoya Jiang, Rui Xie, Jindong Wang, Xing Xie, Wei Ye, Shikun Zhang, Yue Zhang

To ensure the reliability of PandaLM, we collect a diverse human-annotated test dataset, where all contexts are generated by humans and labels are aligned with human preferences.

Language Modelling Large Language Model

Retrieval-Enhanced Visual Prompt Learning for Few-shot Classification

no code implementations4 Jun 2023 Jintao Rong, Hao Chen, Tianxiao Chen, Linlin Ou, Xinyi Yu, Yifan Liu

Prompt learning has become a popular approach for adapting large vision-language models, such as CLIP, to downstream tasks.

Classification Domain Generalization +3

Few-Shot Speaker Identification Using Lightweight Prototypical Network with Feature Grouping and Interaction

no code implementations31 May 2023 Yanxiong Li, Hao Chen, Wenchang Cao, Qisheng Huang, Qianhua He

In the proposed embedding module, audio feature of each speech sample is split into several low-dimensional feature subsets that are transformed by a recurrent convolutional block in parallel.

Speaker Identification

Medication Recommendation via Domain Knowledge Informed Deep Learning

no code implementations31 May 2023 Sicen Liu, Xiaolong Wang, Xianbing Zhao, Hao Chen

However, most of them neglect incorporating domain knowledge according to the clinical manifestations in the EHR of the patient.

Machine learning with tree tensor networks, CP rank constraints, and tensor dropout

no code implementations30 May 2023 Hao Chen, Thomas Barthel

As suggested in [arXiv:2205. 15296] in the context of quantum many-body physics, computation costs can be further substantially reduced by imposing constraints on the canonical polyadic (CP) rank of the tensors in such networks.

Image Classification Tensor Networks

Scale-aware Super-resolution Network with Dual Affinity Learning for Lesion Segmentation from Medical Images

no code implementations30 May 2023 Yanwen Li, Luyang Luo, Huangjing Lin, Pheng-Ann Heng, Hao Chen

To guide the segmentation branch to learn from richer high-resolution features, we propose a feature affinity module and a scale affinity module to enhance the multi-task learning of the dual branches.

Image Segmentation Image Super-Resolution +4

Learning Conditional Attributes for Compositional Zero-Shot Learning

1 code implementation CVPR 2023 Qingsheng Wang, Lingqiao Liu, Chenchen Jing, Hao Chen, Guoqiang Liang, Peng Wang, Chunhua Shen

Compositional Zero-Shot Learning (CZSL) aims to train models to recognize novel compositional concepts based on learned concepts such as attribute-object combinations.

Attribute Compositional Zero-Shot Learning

LoRAPrune: Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning

no code implementations28 May 2023 Mingyang Zhang, Hao Chen, Chunhua Shen, Zhen Yang, Linlin Ou, Xinyi Yu, Bohan Zhuang

This is due to their utilization of unstructured pruning on LPMs, impeding the merging of LoRA weights, or their dependence on the gradients of pre-trained weights to guide pruning, which can impose significant memory overhead.

Model Compression Network Pruning

Continuous Cross-resolution Remote Sensing Image Change Detection

1 code implementation24 May 2023 Hao Chen, Haotian Zhang, Keyan Chen, Chenyao Zhou, Song Chen, Zhengxia Zou, Zhenwei Shi

Toward continuous cross-resolution CD, we propose scale-invariant learning to enforce the model consistently predicting HR results given synthesized samples of varying resolution differences.

Change Detection

Understanding Programs by Exploiting (Fuzzing) Test Cases

1 code implementation23 May 2023 Jianyu Zhao, Yuyang Rong, Yiwen Guo, Yifeng He, Hao Chen

The effectiveness of the proposed method is verified on two program understanding tasks including code clone detection and code classification, and it outperforms current state-of-the-arts by large margins.

Clone Detection Code Classification +2

Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

1 code implementation22 May 2023 Yang Liu, Muzhi Zhu, Hengtao Li, Hao Chen, Xinlong Wang, Chunhua Shen

In this work, we present Matcher, a novel perception paradigm that utilizes off-the-shelf vision foundation models to address various perception tasks.

Segmentation Semantic Segmentation

Multi-channel Integrated Recommendation with Exposure Constraints

no code implementations21 May 2023 Yue Xu, Qijie Shen, Jianwen Yin, Zengde Deng, Dimin Wang, Hao Chen, Lixiang Lai, Tao Zhuang, Junfeng Ge

Integrated recommendation, which aims at jointly recommending heterogeneous items from different channels in a main feed, has been widely applied to various online platforms.

Recommendation Systems

Multi-factor Sequential Re-ranking with Perception-Aware Diversification

no code implementations21 May 2023 Yue Xu, Hao Chen, Zefan Wang, Jianwen Yin, Qijie Shen, Dimin Wang, Feiran Huang, Lixiang Lai, Tao Zhuang, Junfeng Ge, Xia Hu

Feed recommendation systems, which recommend a sequence of items for users to browse and interact with, have gained significant popularity in practical applications.

Graph Clustering Recommendation Systems +1

Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning

no code implementations16 May 2023 Hao Chen, Yiming Zhang, Qi Zhang, Hantao Yang, Xiaomeng Hu, Xuetao Ma, Yifan Yanggong, Junbo Zhao

Instruction tuning for large language models (LLMs) has gained attention from researchers due to its ability to unlock the potential of LLMs in following instructions.

Reference-based OCT Angiogram Super-resolution with Learnable Texture Generation

no code implementations10 May 2023 Yuyan Ruan, Dawei Yang, Ziqi Tang, An Ran Ran, Carol Y. Cheung, Hao Chen

The key difference between the proposed method and traditional RefSR models is that the textures used during inference are generated by the LTG instead of being searched from a single reference image.

Reference-based Super-Resolution Texture Synthesis

Revolutionizing Agrifood Systems with Artificial Intelligence: A Survey

no code implementations3 May 2023 Tao Chen, Liang Lv, Di Wang, Jing Zhang, Yue Yang, Zeyang Zhao, Chen Wang, Xiaowei Guo, Hao Chen, Qingye Wang, Yufei Xu, Qiming Zhang, Bo Du, Liangpei Zhang, DaCheng Tao

With the world population rapidly increasing, transforming our agrifood systems to be more productive, efficient, safe, and sustainable is crucial to mitigate potential food shortages.

Improving Adversarial Transferability via Intermediate-level Perturbation Decay

2 code implementations NeurIPS 2023 Qizhang Li, Yiwen Guo, WangMeng Zuo, Hao Chen

In particular, the proposed method, named intermediate-level perturbation decay (ILPD), encourages the intermediate-level perturbation to be in an effective adversarial direction and to possess a great magnitude simultaneously.

Harnessing the Power of Text-image Contrastive Models for Automatic Detection of Online Misinformation

no code implementations19 Apr 2023 Hao Chen, Peng Zheng, Xin Wang, Shu Hu, Bin Zhu, Jinrong Hu, Xi Wu, Siwei Lyu

As growing usage of social media websites in the recent decades, the amount of news articles spreading online rapidly, resulting in an unprecedented scale of potentially fraudulent information.

Contrastive Learning Misinformation +1

CrossFusion: Interleaving Cross-modal Complementation for Noise-resistant 3D Object Detection

no code implementations19 Apr 2023 Yang Yang, Weijie Ma, Hao Chen, Linlin Ou, Xinyi Yu

The combination of LiDAR and camera modalities is proven to be necessary and typical for 3D object detection according to recent studies.

3D Object Detection Depth Estimation +1

Learning to Fuse Monocular and Multi-view Cues for Multi-frame Depth Estimation in Dynamic Scenes

1 code implementation CVPR 2023 Rui Li, Dong Gong, Wei Yin, Hao Chen, Yu Zhu, Kaixuan Wang, Xiaozhi Chen, Jinqiu Sun, Yanning Zhang

To let the geometric perception learned from multi-view cues in static areas propagate to the monocular representation in dynamic areas and let monocular cues enhance the representation of multi-view cost volume, we propose a cross-cue fusion (CCF) module, which includes the cross-cue attention (CCA) to encode the spatially non-local relative intra-relations from each source to enhance the representation of the other.

Autonomous Driving Depth Estimation

Scale Federated Learning for Label Set Mismatch in Medical Image Classification

1 code implementation14 Apr 2023 Zhipeng Deng, Luyang Luo, Hao Chen

Federated learning (FL) has been introduced to the healthcare domain as a decentralized learning paradigm that allows multiple parties to train a model collaboratively without privacy leakage.

Federated Learning Image Classification +2

Exploring Vision-Language Models for Imbalanced Learning

1 code implementation4 Apr 2023 Yidong Wang, Zhuohao Yu, Jindong Wang, Qiang Heng, Hao Chen, Wei Ye, Rui Xie, Xing Xie, Shikun Zhang

However, their performance on imbalanced dataset is relatively poor, where the distribution of classes in the training dataset is skewed, leading to poor performance in predicting minority classes.

Zero-Shot Learning

Learning Robust Medical Image Segmentation from Multi-source Annotations

no code implementations2 Apr 2023 Yifeng Wang, Luyang Luo, Mingxiang Wu, Qiong Wang, Hao Chen

Learning segmentation networks from multi-source annotations remains a challenge due to the uncertainties brought by the variance of annotations and the quality of images.

Image Segmentation MRI segmentation +2

Recover Triggered States: Protect Model Against Backdoor Attack in Reinforcement Learning

1 code implementation1 Apr 2023 Hao Chen, Chen Gong, Yizhe WANG, Xinwen Hou

This paper proposes the Recovery Triggered States (RTS) method, a novel approach that effectively protects the victim agents from backdoor attacks.

Backdoor Attack reinforcement-learning

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

1 code implementation30 Mar 2023 Wen Wang, Yan Jiang, Kangyang Xie, Zide Liu, Hao Chen, Yue Cao, Xinlong Wang, Chunhua Shen

Our vid2vid-zero leverages off-the-shelf image diffusion models, and doesn't require training on any video.

Image Generation Video Alignment +1

Image Quality-aware Diagnosis via Meta-knowledge Co-embedding

1 code implementation CVPR 2023 Haoxuan Che, Siyu Chen, Hao Chen

Medical images usually suffer from image degradation in clinical practice, leading to decreased performance of deep learning-based models.

Image Quality Assessment Meta-Learning

DoNet: Deep De-overlapping Network for Cytology Instance Segmentation

1 code implementation CVPR 2023 Hao Jiang, Rushan Zhang, Yanning Zhou, Yumeng Wang, Hao Chen

Cell instance segmentation in cytology images has significant importance for biology analysis and cancer screening, while remains challenging due to 1) the extensive overlapping translucent cell clusters that cause the ambiguous boundaries, and 2) the confusion of mimics and debris as nuclei.

Instance Segmentation Region Proposal +2

Adversarial Attack and Defense for Medical Image Analysis: Methods and Applications

no code implementations24 Mar 2023 Junhao Dong, Junxi Chen, Xiaohua Xie, JianHuang Lai, Hao Chen

In this exposition, we present a comprehensive survey on recent advances in adversarial attack and defense for medical image analysis with a novel taxonomy in terms of the application scenario.

Adversarial Attack Medical Diagnosis

Few Shot Medical Image Segmentation with Cross Attention Transformer

1 code implementation24 Mar 2023 Yi Lin, Yufan Chen, Kwang-Ting Cheng, Hao Chen

Our proposed network mines the correlations between the support image and query image, limiting them to focus only on useful foreground information and boosting the representation capacity of both the support prototype and query features.

Few-Shot Learning Image Segmentation +3

Towards Scalable Neural Representation for Diverse Videos

no code implementations CVPR 2023 Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava

Implicit neural representations (INR) have gained increasing attention in representing 3D scenes and images, and have been recently applied to encode videos (e. g., NeRV, E-NeRV).

Action Recognition Video Compression

Two-level Graph Network for Few-Shot Class-Incremental Learning

no code implementations24 Mar 2023 Hao Chen, Linyan Li, Fan Lyu, Fuyuan Hu, Zhenping Xia, Fenglei Xu

Class-level graph network aims to mitigate the semantic conflict between prototype features of new classes and old classes.

Few-Shot Class-Incremental Learning Incremental Learning +1

Boosting Convolution with Efficient MLP-Permutation for Volumetric Medical Image Segmentation

no code implementations23 Mar 2023 Yi Lin, Xiao Fang, Dong Zhang, Kwang-Ting Cheng, Hao Chen

Recently, the advent of vision Transformer (ViT) has brought substantial advancements in 3D dataset benchmarks, particularly in 3D volumetric medical image segmentation (Vol-MedSeg).

Image Segmentation Semantic Segmentation +1

Label-Efficient Deep Learning in Medical Image Analysis: Challenges and Future Directions

no code implementations22 Mar 2023 Cheng Jin, Zhengrui Guo, Yi Lin, Luyang Luo, Hao Chen

Thus, label-efficient deep learning methods are developed to make comprehensive use of the labeled data as well as the abundance of unlabeled and weak-labeled data.

Implicit Ray-Transformers for Multi-view Remote Sensing Image Segmentation

no code implementations15 Mar 2023 Zipeng Qi, Hao Chen, Chenyang Liu, Zhenwei Shi, Zhengxia Zou

In the first stage, we optimize a neural field to encode the color and 3D structure of the remote sensing scene based on multi-view images.

Image Segmentation Scene Segmentation +1

A Monkey Swing Counting Algorithm Based on Object Detection

no code implementations12 Mar 2023 Hao Chen, Zhe-Ming Lu, Jie Liu

This paper focuses on proposing a deep learning-based monkey swing counting algorithm.

object-detection Object Detection

Traj-MAE: Masked Autoencoders for Trajectory Prediction

no code implementations ICCV 2023 Hao Chen, Jiaze Wang, Kun Shao, Furui Liu, Jianye Hao, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng

Specifically, our Traj-MAE employs diverse masking strategies to pre-train the trajectory encoder and map encoder, allowing for the capture of social and temporal information among agents while leveraging the effect of environment from multiple granularities.

Autonomous Driving Trajectory Prediction

Multi-Content Interaction Network for Few-Shot Segmentation

no code implementations11 Mar 2023 Hao Chen, Yunlong Yu, Yonghan Dong, Zheming Lu, Yingming Li, Zhongfei Zhang

Few-Shot Segmentation (FSS) is challenging for limited support images and large intra-class appearance discrepancies.

Knowledge Transfer via Multi-Head Feature Adaptation for Whole Slide Image Classification

no code implementations10 Mar 2023 Conghao Xiong, Yi Lin, Hao Chen, Joseph Sung, Irwin King

Transferring prior knowledge from a source domain to the same or similar target domain can greatly enhance the performance of models on the target domain.

Classification Image Classification +1

Parallel Computing Based Solution for Reliability-Constrained Distribution Network Planning

no code implementations9 Mar 2023 Yaqi Sun, Wenchuan Wu, Yi Lin, Hai Huang, Hao Chen

The main goal of distribution network (DN) expansion planning is essentially to achieve minimal investment constrained with specified reliability requirements.

On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective

1 code implementation22 Feb 2023 Jindong Wang, Xixu Hu, Wenxin Hou, Hao Chen, Runkai Zheng, Yidong Wang, Linyi Yang, Haojun Huang, Wei Ye, Xiubo Geng, Binxin Jiao, Yue Zhang, Xing Xie

In this paper, we conduct a thorough evaluation of the robustness of ChatGPT from the adversarial and out-of-distribution (OOD) perspective.

Adversarial Robustness Chatbot +1

Hierarchical Cross-modal Transformer for RGB-D Salient Object Detection

no code implementations16 Feb 2023 Hao Chen, Feihong Shen

Most of existing RGB-D salient object detection (SOD) methods follow the CNN-based paradigm, which is unable to model long-range dependencies across space and modalities due to the natural locality of CNNs.

object-detection RGB-D Salient Object Detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.