Search Results for author: Yichao Yan

Found 37 papers, 10 papers with code

IPAD: Industrial Process Anomaly Detection Dataset

no code implementations23 Apr 2024 Jinfan Liu, Yichao Yan, Junjie Li, Weiming Zhao, Pengzhi Chu, Xingdong Sheng, Yunhui Liu, Xiaokang Yang

Video anomaly detection (VAD) is a challenging task aiming to recognize anomalies in video frames, and existing large-scale VAD researches primarily focus on road traffic and human activity scenes.

Anomaly Detection Video Anomaly Detection +1

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting

no code implementations22 Apr 2024 Weili Zeng, Yichao Yan, Qi Zhu, Zhuo Chen, Pengzhi Chu, Weiming Zhao, Xiaokang Yang

Text-to-image (T2I) customization aims to create images that embody specific visual concepts delineated in textual descriptions.

Rethinking Clothes Changing Person ReID: Conflicts, Synthesis, and Optimization

no code implementations19 Apr 2024 Junjie Li, Guanshuo Wang, Fufu Yu, Yichao Yan, Qiong Jia, Shouhong Ding, Xingdong Sheng, Yunhui Liu, Xiaokang Yang

However, such improvement sacrifices the performance under the standard protocol, caused by the inner conflict between standard and CC.

Clothes Changing Person Re-Identification

Inter-X: Towards Versatile Human-Human Interaction Analysis

no code implementations26 Dec 2023 Liang Xu, Xintao Lv, Yichao Yan, Xin Jin, Shuwen Wu, Congsheng Xu, Yifan Liu, Yizhou Zhou, Fengyun Rao, Xingdong Sheng, Yunhui Liu, Wenjun Zeng, Xiaokang Yang

We also equip Inter-X with versatile annotations of more than 34K fine-grained human part-level textual descriptions, semantic interaction categories, interaction order, and the relationship and personality of the subjects.

SingingHead: A Large-scale 4D Dataset for Singing Head Animation

no code implementations7 Dec 2023 Sijing Wu, Yunhao Li, Weitian Zhang, Jun Jia, Yucheng Zhu, Yichao Yan, Guangtao Zhai

Extensive comparative experiments with both SOTA 3D facial animation and 2D portrait animation methods demonstrate the necessity of singing-specific datasets in singing head animation tasks and the promising performance of our unified facial animation framework.

EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices

no code implementations16 Nov 2023 Jingnan Gao, Zhuo Chen, Yichao Yan, Bowen Pan, Zhe Wang, Jiangjing Lyu, Xiaokang Yang

In our method, we first employ an efficient surface-based model with a multi-view supervision module to ensure accurate mesh reconstruction.

3D Reconstruction Surface Reconstruction

Generalizable Person Search on Open-world User-Generated Video Content

no code implementations16 Oct 2023 Junjie Li, Guanshuo Wang, Yichao Yan, Fufu Yu, Qiong Jia, Jie Qin, Shouhong Ding, Xiaokang Yang

Person search is a challenging task that involves detecting and retrieving individuals from a large set of un-cropped scene images.

Domain Generalization Person Search

Directional Texture Editing for 3D Models

no code implementations26 Sep 2023 Shengqi Liu, Zhuo Chen, Jingnan Gao, Yichao Yan, Wenhan Zhu, Jiangjing Lyu, Xiaokang Yang

However, the inherent complexity of 3D models and the ambiguous text description lead to the challenge in this task.

3D Object Editing

HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks

no code implementations19 Apr 2023 Zhuo Chen, Xudong Xu, Yichao Yan, Ye Pan, Wenhan Zhu, Wayne Wu, Bo Dai, Xiaokang Yang

While the use of 3D-aware GANs bypasses the requirement of 3D data, we further alleviate the necessity of style images with the CLIP model being the stylization guidance.

Attribute

GANHead: Towards Generative Animatable Neural Head Avatars

no code implementations CVPR 2023 Sijing Wu, Yichao Yan, Yunhao Li, Yuhao Cheng, Wenhan Zhu, Ke Gao, Xiaobo Li, Guangtao Zhai

To bring digital avatars into people's lives, it is highly demanded to efficiently generate complete, realistic, and animatable head avatars.

Head3D: Complete 3D Head Generation via Tri-plane Feature Distillation

no code implementations28 Mar 2023 Yuhao Cheng, Yichao Yan, Wenhan Zhu, Ye Pan, Bowen Pan, Xiaokang Yang

Head generation with diverse identities is an important task in computer vision and computer graphics, widely used in multimedia applications.

3D-Aware Face Swapping

no code implementations CVPR 2023 Yixuan Li, Chao Ma, Yichao Yan, Wenhan Zhu, Xiaokang Yang

To achieve this, we take advantage of the strong geometry and texture prior of 3D human faces, where the 2D faces are projected into the latent space of a 3D generative model.

Attribute Face Swapping

Domain Adaptive Person Search

2 code implementations25 Jul 2022 Junjie Li, Yichao Yan, Guanshuo Wang, Fufu Yu, Qiong Jia, Shouhong Ding

In this paper, we take a further step and present Domain Adaptive Person Search (DAPS), which aims to generalize the model from a labeled source domain to the unlabeled target domain.

Pedestrian Detection Person Re-Identification +1

ActFormer: A GAN-based Transformer towards General Action-Conditioned 3D Human Motion Generation

no code implementations ICCV 2023 Liang Xu, Ziyang Song, Dongliang Wang, Jing Su, Zhicheng Fang, Chenjing Ding, Weihao Gan, Yichao Yan, Xin Jin, Xiaokang Yang, Wenjun Zeng, Wei Wu

We present a GAN-based Transformer for general action-conditioned 3D human motion generation, including not only single-person actions but also multi-person interactive actions.

A Coding Framework and Benchmark towards Compressed Video Understanding

no code implementations6 Feb 2022 Yuan Tian, Guo Lu, Yichao Yan, Guangtao Zhai, Li Chen, Zhiyong Gao

However, in real-world scenarios, the videos are first compressed before the transportation and then decompressed for understanding.

Video Understanding

DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering

no code implementations3 Jan 2022 Shunyu Yao, RuiZhe Zhong, Yichao Yan, Guangtao Zhai, Xiaokang Yang

Specifically, neural radiance field takes lip movements features and personalized attributes as two disentangled conditions, where lip movements are directly predicted from the audio inputs to achieve lip-synchronized generation.

Neural Rendering Talking Head Generation

MovieNet-PS: A Large-Scale Person Search Dataset in the Wild

1 code implementation5 Dec 2021 Jie Qin, Peng Zheng, Yichao Yan, Rong Quan, Xiaogang Cheng, Bingbing Ni

Person search aims to jointly localize and identify a query person from natural, uncropped images, which has been actively studied over the past few years.

Person Search

TAL: Two-stream Adaptive Learning for Generalizable Person Re-identification

no code implementations29 Nov 2021 Yichao Yan, Junjie Li, Shengcai Liao, Jie Qin, Bingbing Ni, Xiaokang Yang

In the meantime, we design an adaptive BN layer in the domain-invariant stream, to approximate the statistics of various unseen domains.

Domain Generalization Generalizable Person Re-identification +1

Efficient Person Search: An Anchor-Free Approach

4 code implementations1 Sep 2021 Yichao Yan, Jinpeng Li, Jie Qin, Shengcai Liao, Xiaokang Yang

Third, by investigating the advantages of both anchor-based and anchor-free models, we further augment AlignPS with an ROI-Align head, which significantly improves the robustness of re-id features while still keeping our model highly efficient.

Person Search

EAN: Event Adaptive Network for Enhanced Action Recognition

1 code implementation22 Jul 2021 Yuan Tian, Yichao Yan, Guangtao Zhai, Guodong Guo, Zhiyong Gao

In this paper, we propose a unified action recognition framework to investigate the dynamic nature of video content by introducing the following designs.

Action Recognition

Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification

1 code implementation CVPR 2020 Yichao Yan, Jie Qin1, Jiaxin Chen, Li Liu, Fan Zhu, Ying Tai, Ling Shao

In each hypergraph, different temporal granularities are captured by hyperedges that connect a set of graph nodes (i. e., part-based features) across different temporal ranges.

Video-Based Person Re-Identification

Learning Multi-Attention Context Graph for Group-Based Re-Identification

1 code implementation29 Apr 2021 Yichao Yan, Jie Qin, Bingbing Ni, Jiaxin Chen, Li Liu, Fan Zhu, Wei-Shi Zheng, Xiaokang Yang, Ling Shao

Extensive experiments on the novel dataset as well as three existing datasets clearly demonstrate the effectiveness of the proposed framework for both group-based re-id tasks.

Person Re-Identification

Anchor-Free Person Search

1 code implementation CVPR 2021 Yichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, Ling Shao

Person search aims to simultaneously localize and identify a query person from realistic, uncropped images, which can be regarded as the unified task of pedestrian detection and person re-identification (re-id).

Pedestrian Detection Person Re-Identification +1

Pose Transferrable Person Re-Identification

no code implementations CVPR 2018 Jinxian Liu, Bingbing Ni, Yichao Yan, Peng Zhou, Shuo Cheng, Jianguo Hu

On the other hand, in addition to the conventional discriminator of GAN (i. e., to distinguish between REAL/FAKE samples), we propose a novel guider sub-network which encourages the generated sample (i. e., with novel pose) towards better satisfying the ReID loss (i. e., cross-entropy ReID loss, triplet ReID loss).

Person Re-Identification

Image Matching via Loopy RNN

no code implementations10 Jun 2017 Donghao Luo, Bingbing Ni, Yichao Yan, Xiaokang Yang

Towards this end, we propose a novel loopy recurrent neural network (Loopy RNN), which is capable of aggregating relationship information of two input images in a progressive/iterative manner and outputting the consolidated matching score in the final iteration.

Depth Structure Preserving Scene Image Generation

no code implementations1 Jun 2017 Wendong Zhang, Bingbing Ni, Yichao Yan, Jingwei Xu, Xiaokang Yang

Key to automatically generate natural scene images is to properly arrange among various spatial elements, especially in the depth direction.

Image Generation Scene Generation

Predicting Human Interaction via Relative Attention Model

no code implementations26 May 2017 Yichao Yan, Bingbing Ni, Xiaokang Yang

Predicting human interaction is challenging as the on-going activity has to be inferred based on a partially observed video.

Person Re-Identification via Recurrent Feature Aggregation

1 code implementation23 Jan 2017 Yichao Yan, Bingbing Ni, Zhichao Song, Chao Ma, Yan Yan, Xiaokang Yang

We address the person re-identification problem by effectively exploiting a globally discriminative feature representation from a sequence of tracked human regions/patches.

Patch Matching Person Re-Identification

Cannot find the paper you are looking for? You can Submit a new open access paper.