Search Results for author: YiFan Li

Found 34 papers, 10 papers with code

Facial Affective Behavior Analysis with Instruction Tuning

no code implementations • 7 Apr 2024 • YiFan Li, Anh Dao, Wentao Bao, Zhen Tan, Tianlong Chen, Huan Liu, Yu Kong

Our initiative on the dataset and benchmarks reveal the nature and rationale of facial affective behaviors, i. e., fine-grained facial movement, interpretability, and reasoning.

Instruction Following

Paper
Add Code

Weakly-Supervised 3D Scene Graph Generation via Visual-Linguistic Assisted Pseudo-labeling

no code implementations • 3 Apr 2024 • Xu Wang, YiFan Li, Qiudan Zhang, Wenhui Wu, Mark Junjie Li, Jianmin Jinag

However, previous 3D scene graph generation methods utilize a fully supervised learning manner and require a large amount of entity-level annotation data of objects and relations, which is extremely resource-consuming and tedious to obtain.

3d scene graph generation Graph Generation +1

Paper
Add Code

Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models

1 code implementation • 14 Mar 2024 • YiFan Li, Hangyu Guo, Kun Zhou, Wayne Xin Zhao, Ji-Rong Wen

In this paper, we study the harmlessness alignment problem of multimodal large language models (MLLMs).

Paper
Code

The Wolf Within: Covert Injection of Malice into MLLM Societies via an MLLM Operative

1 code implementation • 20 Feb 2024 • Zhen Tan, Chengshuai Zhao, Raha Moraffah, YiFan Li, Yu Kong, Tianlong Chen, Huan Liu

Unlike direct harmful output generation for MLLMs, our research demonstrates how a single MLLM agent can be subtly influenced to generate prompts that, in turn, induce other MLLM agents in the society to output malicious content.

Misinformation

Paper
Code

Repositioning the Subject within Image

1 code implementation • 30 Jan 2024 • Yikai Wang, Chenjie Cao, Ke Fan, Qiaole Dong, YiFan Li, xiangyang xue, Yanwei Fu

Our research reveals that the fundamental sub-tasks of subject repositioning, which include filling the void left by the repositioned subject, reconstructing obscured portions of the subject and blending the subject to be consistent with surrounding areas, can be effectively reformulated as a unified, prompt-guided inpainting task.

Image Generation Image Manipulation

Paper
Code

Temporal Adaptive RGBT Tracking with Modality Prompt

no code implementations • 2 Jan 2024 • Hongyu Wang, Xiaotao Liu, YiFan Li, Meng Sun, Dian Yuan, Jing Liu

RGBT tracking has been widely used in various fields such as robotics, surveillance processing, and autonomous driving.

Ranked #2 on Rgb-T Tracking on RGBT210

Autonomous Driving Rgb-T Tracking

Paper
Add Code

CSGNN: Conquering Noisy Node labels via Dynamic Class-wise Selection

no code implementations • 20 Nov 2023 • YiFan Li, Zhen Tan, Kai Shu, Zongsheng Cao, Yu Kong, Huan Liu

Graph Neural Networks (GNNs) have emerged as a powerful tool for representation learning on graphs, but they often suffer from overfitting and label noise issues, especially when the data is scarce or imbalanced.

Memorization Representation Learning

Paper
Add Code

A Novel Tree Model-based DNN to Achieve a High-Resolution DOA Estimation via Massive MIMO receive array

no code implementations • 15 Nov 2023 • YiFan Li, Feng Shu, Jun Zou, Wei Gao, Yaoliang Song, Jiangzhou Wang

To satisfy the high-resolution requirements of direction-of-arrival (DOA) estimation, conventional deep neural network (DNN)-based methods using grid idea need to significantly increase the number of output classifications and also produce a huge high model complexity.

Paper
Add Code

WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

no code implementations • 30 Aug 2023 • Tianyu Wang, YiFan Li, Haitao Lin, xiangyang xue, Yanwei Fu

The target instruction is then forwarded to a visual grounding system for object pose and size estimation, following which the robot grasps the object accordingly.

Language Modelling Large Language Model +3

Paper
Add Code

A New Heterogeneous Hybrid Massive MIMO Receiver with An Intrinsic Ability of Removing Phase Ambiguity of DOA Estimation via Machine Learning

no code implementations • 16 Aug 2023 • Feng Shu, Baihua Shi, YiWen Chen, Jiatong Bai, YiFan Li, Tingting Liu, Zhu Han

To address this problem, a new heterogeneous sub-connected hybrid analog and digital (HAD) MIMO structure is proposed with an intrinsic ability of removing phase ambiguity and a corresponding new framework is developed to implement a rapid high-precision DOA estimation using only single time-slot.

Clustering

Paper
Add Code

MedAugment: Universal Automatic Data Augmentation Plug-in for Medical Image Analysis

1 code implementation • 30 Jun 2023 • Zhaoshan Liu, Qiujie Lv, YiFan Li, Ziduo Yang, Lei Shen

The prevalent DA approaches in MIA encompass conventional DA, synthetic DA, and automatic DA.

Data Augmentation

Paper
Code

Evaluating Object Hallucination in Large Vision-Language Models

2 code implementations • 17 May 2023 • YiFan Li, Yifan Du, Kun Zhou, Jinpeng Wang, Wayne Xin Zhao, Ji-Rong Wen

Despite the promising progress on LVLMs, we find that LVLMs suffer from the hallucination problem, i. e. they tend to generate objects that are inconsistent with the target images in the descriptions.

Hallucination Object

211

Paper
Code

Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation

no code implementations • 6 May 2023 • Kun Zhou, YiFan Li, Wayne Xin Zhao, Ji-Rong Wen

To solve it, we propose Diffusion-NAT, which introduces discrete diffusion models~(DDM) into NAR text-to-text generation and integrates BART to improve the performance.

Denoising Text Generation

Paper
Add Code

A Survey of Large Language Models

5 code implementations • 31 Mar 2023 • Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, YiFan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-Yun Nie, Ji-Rong Wen

To discriminate the difference in parameter scale, the research community has coined the term large language models (LLM) for the PLMs of significant size.

Language Modelling

8,791

Paper
Code

Diffusion Models for Non-autoregressive Text Generation: A Survey

1 code implementation • 12 Mar 2023 • YiFan Li, Kun Zhou, Wayne Xin Zhao, Ji-Rong Wen

In this survey, we review the recent progress in diffusion models for NAR text generation.

Text Generation

Paper
Code

Low-Complexity Three-Dimensional AOA-Cross Geometric Center Localization Methods via Multi-UAV network

no code implementations • 21 Feb 2023 • Baihua Shi, YiFan Li, Guilu Wu, Shihao Yan, Feng Shu

It is not able to estimate angle noise but has lower computational complexity.

Position

Paper
Add Code

DISC: Learning From Noisy Labels via Dynamic Instance-Specific Selection and Correction

1 code implementation • CVPR 2023 • YiFan Li, Hu Han, Shiguang Shan, Xilin Chen

Then we propose a dynamic threshold strategy for each instance, based on the momentum of each instance's memorization strength in previous epochs to select and correct noisy labeled data.

Learning with noisy labels Memorization

Paper
Code

Recent Advances in RecBole: Extensions with more Practical Considerations

1 code implementation • 28 Nov 2022 • Lanling Xu, Zhen Tian, Gaowei Zhang, Lei Wang, Junjie Zhang, Bowen Zheng, YiFan Li, Yupeng Hou, Xingyu Pan, Yushuo Chen, Wayne Xin Zhao, Xu Chen, Ji-Rong Wen

In order to show the recent update in RecBole, we write this technical report to introduce our latest improvements on RecBole.

3,178

Paper
Code

A 2030 United States Macro Grid Unlocking Geographical Diversity to Accomplish Clean Energy Goals

no code implementations • 19 Nov 2022 • Yixing Xu, Daniel Olsen, Bainan Xia, Dan Livengood, Victoria Hunt, YiFan Li, Lane Smith

Some U. S. states have set clean energy goals and targets in an effort to decarbonize their electricity sectors.

Paper
Add Code

Deep Learning Based DOA Estimation for Hybrid Massive MIMO Receive Array with Overlapped Subarrays

no code implementations • 11 Sep 2022 • YiFan Li, Baihua Shi, Feng Shu, Yaoliang Song, Jiangzhou Wang

To improve the accuracy of direction-of-arrival (DOA) estimation, a deep learning (DL)-based method called CDAE-DNN is proposed for hybrid analog and digital (HAD) massive MIMO receive array with overlapped subarray (OSA) architecture in this paper.

Paper
Add Code

Recent Progress in Transformer-based Medical Image Analysis

no code implementations • 13 Aug 2022 • Zhaoshan Liu, Qiujie Lv, Ziduo Yang, YiFan Li, Chau Hung Lee, Lei Shen

The mainstream classification and segmentation tasks are further divided into eleven medical image modalities.

Denoising

Paper
Add Code

Optimal Measurement of Drone Swarm in RSS-based Passive Localization with Region Constraints

no code implementations • 2 Aug 2022 • Xin Cheng, Feng Shu, YiFan Li, Zhihong Zhuang, Di wu, Jiangzhou Wang

In this paper, optimal geometrical configurations of UAVs in received signal strength (RSS)-based localization under region constraints are investigated.

Paper
Add Code

Affective Behaviour Analysis Using Pretrained Model with Facial Priori

1 code implementation • 24 Jul 2022 • YiFan Li, Haomiao Sun, Zhaori Liu, Hu Han

As a result, we utilize AffectNet pretrained CNN to extract expression scores concatenating with expression and AU scores from ViT to obtain the final VA features.

Emotion Recognition

Paper
Code

LQoCo: Learning to Optimize Cache Capacity Overloading in Storage Systems

no code implementations • 21 Mar 2022 • Ji Zhang, Xijun Li, Xiyao Zhou, Mingxuan Yuan, Zhuo Cheng, Keji Huang, YiFan Li

Cache plays an important role to maintain high and stable performance (i. e. high throughput, low tail latency and throughput jitter) in storage systems.

Management

Paper
Add Code

Machine Learning Methods for Inferring the Number of UAV Emitters via Massive MIMO Receive Array

no code implementations • 2 Mar 2022 • YiFan Li, Feng Shu, Jinsong Hu, Shihao Yan, Haiwei Song, Weiqiang Zhu, Da Tian, Yaoliang Song, Jiangzhou Wang

The simulation results show that the machine learning-based methods can achieve good results in signal classification, especially neural networks, which can always maintain the classification accuracy above 70\% with massive MIMO receive array.

Classification

Paper
Add Code

Identifying and mitigating bias in algorithms used to manage patients in a pandemic

no code implementations • 30 Oct 2021 • YiFan Li, Garrett Yoon, Mustafa Nasir-Moin, David Rosenberg, Sean Neifert, Douglas Kondziolka, Eric Karl Oermann

Numerous COVID-19 clinical decision support systems have been developed.

Fairness

Paper
Add Code

Adaptive unsupervised learning with enhanced feature representation for intra-tumor partitioning and survival prediction for glioblastoma

no code implementations • 21 Aug 2021 • YiFan Li, Chao Li, Yiran Wei, Stephen Price, Carola-Bibiane Schönlieb, Xi Chen

In this paper, we propose an adaptive unsupervised learning approach for efficient MRI intra-tumor partitioning and glioblastoma survival prediction.

Bayesian Optimization Clustering +1

Paper
Add Code

High-performance Passive Eigen-model-based Detectors of Single Emitter Using Massive MIMO Receivers

no code implementations • 3 Aug 2021 • Qijuan Jie, Xichao Zhan, Feng Shu, Yaohui Ding, Baihua Shi, YiFan Li, Jiangzhou Wang

The test statistic (TS) of the first method is defined as the ratio of maximum eigen-value (Max-EV) to minimum eigen-value (R-MaxEV-MinEV) while that of the second one is defined as the ratio of Max-EV to noise variance (R-MaxEV-NV).

Paper
Add Code

$\rm ^{83}Rb$/$\rm ^{83m}Kr$ production and cross-section measurement with 3.4 MeV and 20 MeV proton beams

no code implementations • 4 Feb 2021 • Dan Zhang, Jingkai Xia, YiFan Li, Jingtao You, Yao Li, Changbo Fu, Jianglai Liu, Ning Zhou, Jie Bao, Huan Jia, Chenzhang Yuan, Yuan He, Weixing Xiong, Mengyun Guan

$\rm ^{83m}Kr$, with a short lifetime, is an ideal calibration source for liquid xenon or liquid argon detectors.

Nuclear Experiment Instrumentation and Detectors

Paper
Add Code

Bayesian optimization assisted unsupervised learning for efficient intra-tumor partitioning in MRI and survival prediction for glioblastoma patients

no code implementations • 5 Dec 2020 • YiFan Li, Chao Li, Stephen Price, Carola-Bibiane Schönlieb, Xi Chen

Although successful in tumor sub-region segmentation and survival prediction, radiomics based on machine learning algorithms, is challenged by its robustness, due to the vague intermediate process and track changes.

Bayesian Optimization BIG-bench Machine Learning +2