Search Results for author: Yichao Yan

Found 37 papers, 10 papers with code

IPAD: Industrial Process Anomaly Detection Dataset

no code implementations • 23 Apr 2024 • Jinfan Liu, Yichao Yan, Junjie Li, Weiming Zhao, Pengzhi Chu, Xingdong Sheng, Yunhui Liu, Xiaokang Yang

Video anomaly detection (VAD) is a challenging task aiming to recognize anomalies in video frames, and existing large-scale VAD researches primarily focus on road traffic and human activity scenes.

Anomaly Detection Video Anomaly Detection +1

Paper
Add Code

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

no code implementations • 22 Apr 2024 • Weili Zeng, Yichao Yan, Qi Zhu, Zhuo Chen, Pengzhi Chu, Weiming Zhao, Xiaokang Yang

Text-to-image (T2I) customization aims to create images that embody specific visual concepts delineated in textual descriptions.

Paper
Add Code

Rethinking Clothes Changing Person ReID: Conflicts, Synthesis, and Optimization

no code implementations • 19 Apr 2024 • Junjie Li, Guanshuo Wang, Fufu Yu, Yichao Yan, Qiong Jia, Shouhong Ding, Xingdong Sheng, Yunhui Liu, Xiaokang Yang

However, such improvement sacrifices the performance under the standard protocol, caused by the inner conflict between standard and CC.

Clothes Changing Person Re-Identification

Paper
Add Code

Monocular Identity-Conditioned Facial Reflectance Reconstruction

no code implementations • 30 Mar 2024 • Xingyu Ren, Jiankang Deng, Yuhao Cheng, Jia Guo, Chao Ma, Yichao Yan, Wenhan Zhu, Xiaokang Yang

We first learn a high-quality prior for facial reflectance.

3D Face Reconstruction

Paper
Add Code

ReGenNet: Towards Human Action-Reaction Synthesis

no code implementations • 18 Mar 2024 • Liang Xu, Yizhou Zhou, Yichao Yan, Xin Jin, Wenhan Zhu, Fengyun Rao, Xiaokang Yang, Wenjun Zeng

Humans constantly interact with their surrounding environments.

Paper
Add Code

A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos

1 code implementation • 11 Mar 2024 • Weixia Zhang, Chengguang Zhu, Jingnan Gao, Yichao Yan, Guangtao Zhai, Xiaokang Yang

However, performance evaluation research lags behind the development of talking head generation techniques.

Talking Head Generation

Paper
Code

Inter-X: Towards Versatile Human-Human Interaction Analysis

no code implementations • 26 Dec 2023 • Liang Xu, Xintao Lv, Yichao Yan, Xin Jin, Shuwen Wu, Congsheng Xu, Yifan Liu, Yizhou Zhou, Fengyun Rao, Xingdong Sheng, Yunhui Liu, Wenjun Zeng, Xiaokang Yang

We also equip Inter-X with versatile annotations of more than 34K fine-grained human part-level textual descriptions, semantic interaction categories, interaction order, and the relationship and personality of the subjects.

Paper
Add Code

SingingHead: A Large-scale 4D Dataset for Singing Head Animation

no code implementations • 7 Dec 2023 • Sijing Wu, Yunhao Li, Weitian Zhang, Jun Jia, Yucheng Zhu, Yichao Yan, Guangtao Zhai

Extensive comparative experiments with both SOTA 3D facial animation and 2D portrait animation methods demonstrate the necessity of singing-specific datasets in singing head animation tasks and the promising performance of our unified facial animation framework.

Paper
Add Code

EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices

no code implementations • 16 Nov 2023 • Jingnan Gao, Zhuo Chen, Yichao Yan, Bowen Pan, Zhe Wang, Jiangjing Lyu, Xiaokang Yang

In our method, we first employ an efficient surface-based model with a multi-view supervision module to ensure accurate mesh reconstruction.

3D Reconstruction Surface Reconstruction

Paper
Add Code

Generalizable Person Search on Open-world User-Generated Video Content

no code implementations • 16 Oct 2023 • Junjie Li, Guanshuo Wang, Yichao Yan, Fufu Yu, Qiong Jia, Jie Qin, Shouhong Ding, Xiaokang Yang

Person search is a challenging task that involves detecting and retrieving individuals from a large set of un-cropped scene images.

Domain Generalization Person Search

Paper
Add Code

Directional Texture Editing for 3D Models

no code implementations • 26 Sep 2023 • Shengqi Liu, Zhuo Chen, Jingnan Gao, Yichao Yan, Wenhan Zhu, Jiangjing Lyu, Xiaokang Yang

However, the inherent complexity of 3D models and the ambiguous text description lead to the challenge in this task.

3D Object Editing

Paper
Add Code

HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks

no code implementations • 19 Apr 2023 • Zhuo Chen, Xudong Xu, Yichao Yan, Ye Pan, Wenhan Zhu, Wayne Wu, Bo Dai, Xiaokang Yang

While the use of 3D-aware GANs bypasses the requirement of 3D data, we further alleviate the necessity of style images with the CLIP model being the stylization guidance.

Attribute

Paper
Add Code

GANHead: Towards Generative Animatable Neural Head Avatars

no code implementations • CVPR 2023 • Sijing Wu, Yichao Yan, Yunhao Li, Yuhao Cheng, Wenhan Zhu, Ke Gao, Xiaobo Li, Guangtao Zhai

To bring digital avatars into people's lives, it is highly demanded to efficiently generate complete, realistic, and animatable head avatars.

Paper
Add Code

Head3D: Complete 3D Head Generation via Tri-plane Feature Distillation

no code implementations • 28 Mar 2023 • Yuhao Cheng, Yichao Yan, Wenhan Zhu, Ye Pan, Bowen Pan, Xiaokang Yang

Head generation with diverse identities is an important task in computer vision and computer graphics, widely used in multimedia applications.

Paper
Add Code

Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues

no code implementations • CVPR 2023 • Xingyu Ren, Jiankang Deng, Chao Ma, Yichao Yan, Xiaokang Yang

Our key insight is that intrinsic semantic attributes such as race, skin color, and age can constrain the albedo map.

3D Face Reconstruction Fairness +1

Paper
Add Code

3D-Aware Face Swapping

no code implementations • CVPR 2023 • Yixuan Li, Chao Ma, Yichao Yan, Wenhan Zhu, Xiaokang Yang

To achieve this, we take advantage of the strong geometry and texture prior of 3D human faces, where the 2D faces are projected into the latent space of a 3D generative model.

Attribute Face Swapping

Paper
Add Code

Domain Adaptive Person Search

2 code implementations • 25 Jul 2022 • Junjie Li, Yichao Yan, Guanshuo Wang, Fufu Yu, Qiong Jia, Shouhong Ding

In this paper, we take a further step and present Domain Adaptive Person Search (DAPS), which aims to generalize the model from a labeled source domain to the unlabeled target domain.

Pedestrian Detection Person Re-Identification +1

Paper
Code

DialogueNeRF: Towards Realistic Avatar Face-to-Face Conversation Video Generation

no code implementations • 15 Mar 2022 • Yichao Yan, Zanwei Zhou, Zi Wang, Jingnan Gao, Xiaokang Yang

In this paper, we propose a novel unified framework based on neural radiance field (NeRF) to address this task.

Talking Head Generation Video Generation

Paper
Add Code

ActFormer: A GAN-based Transformer towards General Action-Conditioned 3D Human Motion Generation

no code implementations • ICCV 2023 • Liang Xu, Ziyang Song, Dongliang Wang, Jing Su, Zhicheng Fang, Chenjing Ding, Weihao Gan, Yichao Yan, Xin Jin, Xiaokang Yang, Wenjun Zeng, Wei Wu

We present a GAN-based Transformer for general action-conditioned 3D human motion generation, including not only single-person actions but also multi-person interactive actions.

Paper
Add Code

A Coding Framework and Benchmark towards Compressed Video Understanding

no code implementations • 6 Feb 2022 • Yuan Tian, Guo Lu, Yichao Yan, Guangtao Zhai, Li Chen, Zhiyong Gao

However, in real-world scenarios, the videos are first compressed before the transportation and then decompressed for understanding.

Video Understanding

Paper
Add Code

DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering

no code implementations • 3 Jan 2022 • Shunyu Yao, RuiZhe Zhong, Yichao Yan, Guangtao Zhai, Xiaokang Yang

Specifically, neural radiance field takes lip movements features and personalized attributes as two disentangled conditions, where lip movements are directly predicted from the audio inputs to achieve lip-synchronized generation.

Neural Rendering Talking Head Generation

Paper
Add Code

MovieNet-PS: A Large-Scale Person Search Dataset in the Wild

1 code implementation • 5 Dec 2021 • Jie Qin, Peng Zheng, Yichao Yan, Rong Quan, Xiaogang Cheng, Bingbing Ni

Person search aims to jointly localize and identify a query person from natural, uncropped images, which has been actively studied over the past few years.

Ranked #3 on Person Search on CUHK-SYSU

Person Search

Paper
Code

TAL: Two-stream Adaptive Learning for Generalizable Person Re-identification

no code implementations • 29 Nov 2021 • Yichao Yan, Junjie Li, Shengcai Liao, Jie Qin, Bingbing Ni, Xiaokang Yang

In the meantime, we design an adaptive BN layer in the domain-invariant stream, to approximate the statistics of various unseen domains.

Domain Generalization Generalizable Person Re-identification +1

Paper
Add Code

Efficient Person Search: An Anchor-Free Approach

4 code implementations • 1 Sep 2021 • Yichao Yan, Jinpeng Li, Jie Qin, Shengcai Liao, Xiaokang Yang

Third, by investigating the advantages of both anchor-based and anchor-free models, we further augment AlignPS with an ROI-Align head, which significantly improves the robustness of re-id features while still keeping our model highly efficient.

Ranked #4 on Person Search on PRW

Person Search

166

Paper
Code

EAN: Event Adaptive Network for Enhanced Action Recognition

1 code implementation • 22 Jul 2021 • Yuan Tian, Yichao Yan, Guangtao Zhai, Guodong Guo, Zhiyong Gao

In this paper, we propose a unified action recognition framework to investigate the dynamic nature of video content by introducing the following designs.

Ranked #14 on Action Recognition on Something-Something V1

Action Recognition

Paper
Code

Local-to-Global Self-Attention in Vision Transformers

no code implementations • 10 Jul 2021 • Jinpeng Li, Yichao Yan, Shengcai Liao, Xiaokang Yang, Ling Shao

Transformers have demonstrated great potential in computer vision tasks.

Image Classification Semantic Segmentation

Paper
Add Code

Exploring Visual Context for Weakly Supervised Person Search

3 code implementations • 19 Jun 2021 • Yichao Yan, Jinpeng Li, Shengcai Liao, Jie Qin, Bingbing Ni, Xiaokang Yang, Ling Shao

This paper inventively considers weakly supervised person search with only bounding box annotations.

Clustering Pedestrian Detection +2

Paper
Code

Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification

1 code implementation • CVPR 2020 • Yichao Yan, Jie Qin1, Jiaxin Chen, Li Liu, Fan Zhu, Ying Tai, Ling Shao

In each hypergraph, different temporal granularities are captured by hyperedges that connect a set of graph nodes (i. e., part-based features) across different temporal ranges.

Ranked #6 on Person Re-Identification on iLIDS-VID

Video-Based Person Re-Identification

Paper
Code

Learning Multi-Attention Context Graph for Group-Based Re-Identification

1 code implementation • 29 Apr 2021 • Yichao Yan, Jie Qin, Bingbing Ni, Jiaxin Chen, Li Liu, Fan Zhu, Wei-Shi Zheng, Xiaokang Yang, Ling Shao

Extensive experiments on the novel dataset as well as three existing datasets clearly demonstrate the effectiveness of the proposed framework for both group-based re-id tasks.

Person Re-Identification

Paper
Code

Anchor-Free Person Search

1 code implementation • CVPR 2021 • Yichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, Ling Shao

Person search aims to simultaneously localize and identify a query person from realistic, uncropped images, which can be regarded as the unified task of pedestrian detection and person re-identification (re-id).

Ranked #10 on Person Search on CUHK-SYSU

Pedestrian Detection Person Re-Identification +1

166

Paper
Code

Learning Context Graph for Person Search

no code implementations • CVPR 2019 • Yichao Yan, Qiang Zhang, Bingbing Ni, Wendong Zhang, Minghao Xu, Xiaokang Yang

Person re-identification has achieved great progress with deep convolutional neural networks.

Graph Learning Person Re-Identification +1

Paper
Add Code

Pose Transferrable Person Re-Identification

no code implementations • CVPR 2018 • Jinxian Liu, Bingbing Ni, Yichao Yan, Peng Zhou, Shuo Cheng, Jianguo Hu

On the other hand, in addition to the conventional discriminator of GAN (i. e., to distinguish between REAL/FAKE samples), we propose a novel guider sub-network which encourages the generated sample (i. e., with novel pose) towards better satisfying the ReID loss (i. e., cross-entropy ReID loss, triplet ReID loss).

Person Re-Identification

Paper
Add Code

Skeleton-aided Articulated Motion Generation

no code implementations • 4 Jul 2017 • Yichao Yan, Jingwei Xu, Bingbing Ni, Xiaokang Yang

This work make the first attempt to generate articulated human motion sequence from a single image.

Ranked #2 on Gesture-to-Gesture Translation on NTU Hand Digit

Gesture-to-Gesture Translation Video Generation

Paper
Add Code

Image Matching via Loopy RNN

no code implementations • 10 Jun 2017 • Donghao Luo, Bingbing Ni, Yichao Yan, Xiaokang Yang

Towards this end, we propose a novel loopy recurrent neural network (Loopy RNN), which is capable of aggregating relationship information of two input images in a progressive/iterative manner and outputting the consolidated matching score in the final iteration.

Paper
Add Code

Depth Structure Preserving Scene Image Generation

no code implementations • 1 Jun 2017 • Wendong Zhang, Bingbing Ni, Yichao Yan, Jingwei Xu, Xiaokang Yang

Key to automatically generate natural scene images is to properly arrange among various spatial elements, especially in the depth direction.

Image Generation Scene Generation

Paper
Add Code

Predicting Human Interaction via Relative Attention Model

no code implementations • 26 May 2017 • Yichao Yan, Bingbing Ni, Xiaokang Yang

Predicting human interaction is challenging as the on-going activity has to be inferred based on a partially observed video.

Paper
Add Code

Person Re-Identification via Recurrent Feature Aggregation

1 code implementation • 23 Jan 2017 • Yichao Yan, Bingbing Ni, Zhichao Song, Chao Ma, Yan Yan, Xiaokang Yang

We address the person re-identification problem by effectively exploiting a globally discriminative feature representation from a sequence of tracked human regions/patches.

Patch Matching Person Re-Identification

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.