Search Results for author: Rui Feng

Found 37 papers, 14 papers with code

QMix: Quality-aware Learning with Mixed Noise for Robust Retinal Disease Diagnosis

no code implementations • 8 Apr 2024 • Junlin Hou, Jilan Xu, Rui Feng, Hao Chen

Previous noise learning methods mainly considered noise arising from images being mislabeled, i. e. label noise, assuming that all mislabeled images are of high image quality.

Paper
Add Code

Domain Adaptation Using Pseudo Labels for COVID-19 Detection

no code implementations • 18 Mar 2024 • Runtian Yuan, Qingqiu Li, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

In response to the need for rapid and accurate COVID-19 diagnosis during the global pandemic, we present a two-stage framework that leverages pseudo labels for domain adaptation to enhance the detection of COVID-19 from CT scans.

COVID-19 Diagnosis Domain Adaptation +1

Paper
Add Code

Advancing COVID-19 Detection in 3D CT Scans

no code implementations • 18 Mar 2024 • Qingqiu Li, Runtian Yuan, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

To make a more accurate diagnosis of COVID-19, we propose a straightforward yet effective model.

Paper
Add Code

Anatomical Structure-Guided Medical Vision-Language Pre-training

no code implementations • 14 Mar 2024 • Qingqiu Li, Xiaohan Yan, Jilan Xu, Runtian Yuan, Yuejie Zhang, Rui Feng, Quanli Shen, Xiaobo Zhang, Shujun Wang

For finding and existence, we regard them as image tags, applying an image-tag recognition decoder to associate image features with their respective tags within each sample and constructing soft labels for contrastive learning to improve the semantic association of different image-report pairs.

Contrastive Learning Representation Learning +2

Paper
Add Code

Retrieval-Augmented Egocentric Video Captioning

no code implementations • 1 Jan 2024 • Jilan Xu, Yifei HUANG, Junlin Hou, Guo Chen, Yuejie Zhang, Rui Feng, Weidi Xie

In this paper, (1) we develop EgoInstructor, a retrieval-augmented multimodal captioning model that automatically retrieves semantically relevant third-person instructional videos to enhance the video captioning of egocentric videos.

Representation Learning Retrieval +1

Paper
Add Code

Large Language Models are Complex Table Parsers

no code implementations • 13 Dec 2023 • Bowen Zhao, Changkai Ji, Yuejie Zhang, Wen He, Yingwen Wang, Qing Wang, Rui Feng, Xiaobo Zhang

With the Generative Pre-trained Transformer 3. 5 (GPT-3. 5) exhibiting remarkable reasoning and comprehension abilities in Natural Language Processing (NLP), most Question Answering (QA) research has primarily centered around general QA tasks based on GPT, neglecting the specific challenges posed by Complex Table QA.

Logical Reasoning Question Answering

Paper
Add Code

DeepPointMap: Advancing LiDAR SLAM with Unified Neural Descriptors

no code implementations • 5 Dec 2023 • Xiaze Zhang, Ziheng Ding, Qi Jing, Yuejie Zhang, Wenchao Ding, Rui Feng

Point clouds have shown significant potential in various domains, including Simultaneous Localization and Mapping (SLAM).

Simultaneous Localization and Mapping

Paper
Add Code

A WINNER+ Based 3-D Non-Stationary Wideband MIMO Channel Model

no code implementations • 1 Dec 2023 • Ji Bian, Jian Sun, Cheng-Xiang Wang, Rui Feng, Jie Huang, Yang Yang, Minggao Zhang

In this paper, a three-dimensional (3-D) non-stationary wideband multiple-input multiple-output (MIMO) channel model based on the WINNER+ channel model is proposed.

Paper
Add Code

Enhanced Knowledge Injection for Radiology Report Generation

no code implementations • 1 Nov 2023 • Qingqiu Li, Jilan Xu, Runtian Yuan, Mohan Chen, Yuejie Zhang, Rui Feng, Xiaobo Zhang, Shang Gao

Automatic generation of radiology reports holds crucial clinical value, as it can alleviate substantial workload on radiologists and remind less experienced ones of potential anomalies.

Image Captioning Retrieval

Paper
Add Code

Open-Set Image Tagging with Multi-Grained Text Supervision

2 code implementations • 23 Oct 2023 • Xinyu Huang, Yi-Jie Huang, Youcai Zhang, Weiwei Tian, Rui Feng, Yuejie Zhang, Yanchun Xie, Yaqian Li, Lei Zhang

Specifically, for predefined commonly used tag categories, RAM++ showcases 10. 2 mAP and 15. 4 mAP enhancements over CLIP on OpenImages and ImageNet.

Human-Object Interaction Detection Open Set Learning +1

2,433

Paper
Code

PolyGET: Accelerating Polymer Simulations by Accurate and Generalizable Forcefield with Equivariant Transformer

no code implementations • 1 Sep 2023 • Rui Feng, Huan Tran, Aubrey Toland, Binghong Chen, Qi Zhu, Rampi Ramprasad, Chao Zhang

Machine learning (ML) forcefields have been developed to achieve both the accuracy of ab initio methods and the efficiency of empirical force fields.

Paper
Add Code

DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images

no code implementations • 5 Apr 2023 • Bo Qian, Hao Chen, Xiangning Wang, Haoxuan Che, Gitaek Kwon, Jaeyoung Kim, Sungjin Choi, Seoyoung Shin, Felix Krause, Markus Unterdechler, Junlin Hou, Rui Feng, Yihao Li, Mostafa El Habib Daho, Qiang Wu, Ping Zhang, Xiaokang Yang, Yiyu Cai, Weiping Jia, Huating Li, Bin Sheng

Computer-assisted automatic analysis of diabetic retinopathy (DR) is of great importance in reducing the risks of vision loss and even blindness.

Benchmarking Data Augmentation +1

Paper
Add Code

Tag2Text: Guiding Vision-Language Model via Image Tagging

2 code implementations • 10 Mar 2023 • Xinyu Huang, Youcai Zhang, Jinyu Ma, Weiwei Tian, Rui Feng, Yuejie Zhang, Yaqian Li, Yandong Guo, Lei Zhang

This paper presents Tag2Text, a vision language pre-training (VLP) framework, which introduces image tagging into vision-language models to guide the learning of visual-linguistic features.

Language Modelling TAG

2,433

Paper
Code

Learning Open-vocabulary Semantic Segmentation Models From Natural Language Supervision

1 code implementation • CVPR 2023 • Jilan Xu, Junlin Hou, Yuejie Zhang, Rui Feng, Yi Wang, Yu Qiao, Weidi Xie

The former aims to infer all masked entities in the caption given the group tokens, that enables the model to learn fine-grained alignment between visual groups and text entities.

Open Vocabulary Semantic Segmentation Semantic Segmentation

Paper
Code

Cross-Field Transformer for Diabetic Retinopathy Grading on Two-field Fundus Images

1 code implementation • 26 Nov 2022 • Junlin Hou, Jilan Xu, Fan Xiao, Rui-Wei Zhao, Yuejie Zhang, Haidong Zou, Lina Lu, Wenwen Xue, Rui Feng

However, automatic DR grading based on two-field fundus photography remains a challenging task due to the lack of publicly available datasets and effective fusion strategies.

Diabetic Retinopathy Grading Position

Paper
Code

CMC v2: Towards More Accurate COVID-19 Detection with Discriminative Video Priors

no code implementations • 26 Nov 2022 • Junlin Hou, Jilan Xu, Nan Zhang, Yi Wang, Yuejie Zhang, Xiaobo Zhang, Rui Feng

This paper presents our solution for the 2nd COVID-19 Competition, occurring in the framework of the AIMIA Workshop at the European Conference on Computer Vision (ECCV 2022).

COVID-19 Diagnosis Representation Learning

Paper
Add Code

Boosting COVID-19 Severity Detection with Infection-aware Contrastive Mixup Classification

no code implementations • 26 Nov 2022 • Junlin Hou, Jilan Xu, Nan Zhang, Yuejie Zhang, Xiaobo Zhang, Rui Feng

In our approach, we devise a novel infection-aware 3D Contrastive Mixup Classification network for severity grading.

Lesion Segmentation Segmentation

Paper
Add Code

End-to-End Stochastic Optimization with Energy-Based Model

1 code implementation • 25 Nov 2022 • Lingkai Kong, Jiaming Cui, Yuchen Zhuang, Rui Feng, B. Aditya Prakash, Chao Zhang

Decision-focused learning (DFL) was recently proposed for stochastic optimization problems that involve unknown parameters.

Scheduling Stochastic Optimization

Paper
Code

Deep-OCTA: Ensemble Deep Learning Approaches for Diabetic Retinopathy Analysis on OCTA Images

1 code implementation • 2 Oct 2022 • Junlin Hou, Fan Xiao, Jilan Xu, Yuejie Zhang, Haidong Zou, Rui Feng

In the image quality assessment task, we create an ensemble of InceptionV3, SE-ResNeXt, and Vision Transformer models.

Data Augmentation Image Quality Assessment

Paper
Code

IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training

1 code implementation • 12 Jul 2022 • Xinyu Huang, Youcai Zhang, Ying Cheng, Weiwei Tian, RuiWei Zhao, Rui Feng, Yuejie Zhang, Yaqian Li, Yandong Guo, Xiaobo Zhang

However, the image-text pairs co-occurrent on the Internet typically lack explicit alignment information, which is suboptimal for VLP.

Multi-Label Learning Object +1

Paper
Code

Modality-Aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection

1 code implementation • 12 Jul 2022 • Jiashuo Yu, Jinyu Liu, Ying Cheng, Rui Feng, Yuejie Zhang

In this paper, we analyze the modality asynchrony and undifferentiated instances phenomena of the multiple instance learning (MIL) procedure, and further investigate its negative impact on weakly-supervised audio-visual learning.

Ranked #5 on Anomaly Detection In Surveillance Videos on XD-Violence

Anomaly Detection In Surveillance Videos audio-visual learning +1

Paper
Code

Learning Music-Dance Representations through Explicit-Implicit Rhythm Synchronization

no code implementations • 7 Jul 2022 • Jiashuo Yu, Junfu Pu, Ying Cheng, Rui Feng, Ying Shan

Although audio-visual representation has been proved to be applicable in many downstream tasks, the representation of dancing videos, which is more specific and always accompanied by music with complex auditory contents, remains challenging and uninvestigated.

Contrastive Learning Representation Learning +2

Paper
Add Code

FDVTS's Solution for 2nd COV19D Competition on COVID-19 Detection and Severity Analysis

no code implementations • 5 Jul 2022 • Junlin Hou, Jilan Xu, Rui Feng, Yuejie Zhang

This paper presents our solution for the 2nd COVID-19 Competition, occurring in the framework of the AIMIA Workshop in the European Conference on Computer Vision (ECCV 2022).

Classification COVID-19 Diagnosis +1

Paper
Add Code

Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation

1 code implementation • 21 Jun 2022 • Shuaicheng Li, Feng Zhang, Rui-Wei Zhao, Rui Feng, Kunlin Yang, Lingbo Liu, Jun Hou

Based on PRSlot modules, we present a novel Pyramid Region-based Slot Attention Network termed PRSA-Net to learn a unified visual representation with rich temporal and semantic context for better proposal generation.

Action Detection Temporal Action Proposal Generation

Paper
Code

Reconfigurable intelligent surfaces: Channel characterization and modeling

no code implementations • 6 Jun 2022 • Jie Huang, Cheng-Xiang Wang, Yingzhuo Sun, Rui Feng, Jialing Huang, Bolun Guo, Zhimeng Zhong, Tie Jun Cui

Reconfigurable intelligent surfaces (RISs) are two dimensional (2D) metasurfaces which can intelligently manipulate electromagnetic waves by low-cost near passive reflecting elements.

Paper
Add Code

CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping

1 code implementation • CVPR 2022 • Jilan Xu, Junlin Hou, Yuejie Zhang, Rui Feng, Rui-Wei Zhao, Tao Zhang, Xuequan Lu, Shang Gao

In this paper, we empirically prove that this problem is associated with the mixup of the activation values between less discriminative foreground regions and the background.

Clustering Object +1

Paper
Code

Self-Supervised Video Representation Learning with Motion-Contrastive Perception

no code implementations • 10 Apr 2022 • Jinyu Liu, Ying Cheng, Yuejie Zhang, Rui-Wei Zhao, Rui Feng

Visual-only self-supervised learning has achieved significant improvement in video representation learning.

Contrastive Learning Representation Learning +1

Paper
Add Code

CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data

no code implementations • NAACL 2022 • Rui Feng, Chen Luo, Qingyu Yin, Bing Yin, Tuo Zhao, Chao Zhang

User sessions empower many search and recommendation tasks on a daily basis.

Entity Linking Self-Supervised Learning +1

Paper
Add Code

Simple and Robust Loss Design for Multi-Label Learning with Missing Labels

2 code implementations • 13 Dec 2021 • Youcai Zhang, Yuhao Cheng, Xinyu Huang, Fei Wen, Rui Feng, Yaqian Li, Yandong Guo

Multi-label learning in the presence of missing labels (MLML) is a challenging problem.

Missing Labels Multi-Label Image Classification

Paper
Code

MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing

1 code implementation • 24 Nov 2021 • Jiashuo Yu, Ying Cheng, Rui-Wei Zhao, Rui Feng, Yuejie Zhang

Recognizing and localizing events in videos is a fundamental task for video understanding.

audio-visual event localization Video Understanding

Paper
Code

MPN: Multimodal Parallel Network for Audio-Visual Event Localization

no code implementations • 7 Apr 2021 • Jiashuo Yu, Ying Cheng, Rui Feng

The localization subnetwork consists of Multimodal Bottleneck Attention Module (MBAM), which is designed to extract fine-grained segment-level contents.

audio-visual event localization General Classification

Paper
Add Code

Probing and Fine-tuning Reading Comprehension Models for Few-shot Event Extraction

no code implementations • 21 Oct 2020 • Rui Feng, Jie Yuan, Chao Zhang

We argue that the event extraction models so trained are inherently label-hungry, and can generalize poorly across domains and text genres. We propose a reading comprehension framework for event extraction. Specifically, we formulate event detection as a textual entailment prediction problem, and argument detection as a question answer-ing problem.

Event Detection Event Extraction +2

Paper
Add Code

Transformer-Based Neural Text Generation with Syntactic Guidance

1 code implementation • 5 Oct 2020 • Yinghao Li, Rui Feng, Isaac Rehg, Chao Zhang

We study the problem of using (partial) constituency parse trees as syntactic guidance for controlled text generation.

Text Generation

Paper
Code

Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning

no code implementations • 13 Aug 2020 • Ying Cheng, Ruize Wang, Zhihao Pan, Rui Feng, Yuejie Zhang

When watching videos, the occurrence of a visual event is often accompanied by an audio event, e. g., the voice of lip motion, the music of playing instruments.

Action Recognition Audio-Visual Synchronization +1

Paper
Add Code

Learning Error-Driven Curriculum for Crowd Counting

no code implementations • 19 Jul 2020 • Wenxi Li, Zhuoqun Cao, Qian Wang, Songjian Chen, Rui Feng

Density regression has been widely employed in crowd counting.

Crowd Counting

Paper
Add Code

Learning Fair Representations via an Adversarial Framework

1 code implementation • 30 Apr 2019 • Rui Feng, Yang Yang, Yuehan Lyu, Chenhao Tan, Yizhou Sun, Chunping Wang

Fairness has become a central issue for our research community as classification algorithms are adopted in societally critical domains such as recidivism prediction and loan approval.

Classification Fairness +1

Paper
Code

Representation Learning for Scale-free Networks

no code implementations • 29 Nov 2017 • Rui Feng, Yang Yang, Wenjie Hu, Fei Wu, Yueting Zhuang

Existing network embedding works primarily focus on preserving the microscopic structure, such as the first- and second-order proximity of vertexes, while the macroscopic scale-free property is largely ignored.

Link Prediction Network Embedding

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.