Search Results for author: Tianshui Chen

Found 41 papers, 25 papers with code

Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition

1 code implementation • 20 Jan 2024 • Yuefang Gao, Yuhao Xie, Zeke Zexi Hu, Tianshui Chen, Liang Lin

Specifically, the framework consists of separate global-local adversarial learning modules that learn domain-invariant global and local features independently.

Cross-Domain Facial Expression Recognition Model Optimization +2

Paper
Code

Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation

no code implementations • 18 Dec 2023 • Hui Fu, Zeqing Wang, Ke Gong, Keze Wang, Tianshui Chen, Haojie Li, Haifeng Zeng, Wenxiong Kang

Moreover, to facilitate disentangled representation learning, we introduce four well-designed constraints: an auxiliary style classifier, an auxiliary inverse classifier, a content contrastive loss, and a pair of latent cycle losses, which can effectively contribute to the construction of the identity-related style space and semantic-related content space.

Disentanglement

Paper
Add Code

MotionCtrl: A Unified and Flexible Motion Controller for Video Generation

1 code implementation • 6 Dec 2023 • Zhouxia Wang, Ziyang Yuan, Xintao Wang, Tianshui Chen, Menghan Xia, Ping Luo, Ying Shan

Therefore, this paper presents MotionCtrl, a unified and flexible motion controller for video generation designed to effectively and independently control camera and object motion.

Object Video Generation

1,064

Paper
Code

SQLNet: Scale-Modulated Query and Localization Network for Few-Shot Class-Agnostic Counting

1 code implementation • 16 Nov 2023 • Hefeng Wu, Yandong Chen, Lingbo Liu, Tianshui Chen, Keze Wang, Liang Lin

In the localization stage, the Scale-aware Multi-head Localization (SAML) module utilizes the query tensor to predict the confidence, location, and size of each potential object.

Paper
Code

Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search

1 code implementation • 15 Nov 2023 • Hefeng Wu, Weifeng Chen, Zhibin Liu, Tianshui Chen, Zhiguang Chen, Liang Lin

Moreover, we propose a proximity data generation (PDG) module to automatically produce more diverse data for cross-modal training.

Contrastive Learning Cross-Modal Retrieval +4

Paper
Code

Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation

1 code implementation • 23 Sep 2023 • Tao Pu, Tianshui Chen, Hefeng Wu, Yongyi Lu, Liang Lin

In this work, we propose a spatial-temporal knowledge-embedded transformer (STKET) that incorporates the prior spatial-temporal knowledge into the multi-head cross-attention mechanism to learn more representative relationship representations.

Graph Generation Object +2

Paper
Code

Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs

2 code implementations • 23 Aug 2023 • Ziyi Tang, Ruilin Wang, Weixing Chen, Keze Wang, Yang Liu, Tianshui Chen, Liang Lin

Despite advancements in LLMs, knowledge-based reasoning remains a longstanding issue due to the fragility of knowledge recall and inference.

counterfactual Science Question Answering

110

Paper
Code

RestoreFormer++: Towards Real-World Blind Face Restoration from Undegraded Key-Value Pairs

1 code implementation • 14 Aug 2023 • Zhouxia Wang, Jiawei Zhang, Tianshui Chen, Wenping Wang, Ping Luo

In this work, we propose RestoreFormer++, which on the one hand introduces fully-spatial attention mechanisms to model the contextual information and the interplay with the priors, and on the other hand, explores an extending degrading model to help generate more realistic degraded face images to alleviate the synthetic-to-real-world gap.

Blind Face Restoration

110

Paper
Code

Perception and Semantic Aware Regularization for Sequential Confidence Calibration

1 code implementation • CVPR 2023 • Zhenghua Peng, Yu Luo, Tianshui Chen, Keke Xu, Shuangping Huang

In this work, we find tokens/sequences with high perception and semantic correlations with the target ones contain more correlated and effective information and thus facilitate more effective regularization.

Language Modelling speech-recognition +1

Paper
Code

Open-World Pose Transfer via Sequential Test-Time Adaption

no code implementations • 20 Mar 2023 • Junyang Chen, Xiaoyu Xian, Zhijing Yang, Tianshui Chen, Yongyi Lu, Yukai Shi, Jinshan Pan, Liang Lin

In open-world conditions, the pose transfer task raises various independent signals: OOD appearance and skeleton, which need to be extracted and distributed in speciality.

Motion Synthesis Person Re-Identification +1

Paper
Add Code

OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided Mixup

2 code implementations • 3 Jan 2023 • Zhijing Yang, Junyang Chen, Yukai Shi, Hao Li, Tianshui Chen, Liang Lin

Image Virtual try-on aims at replacing the cloth on a personal image with a garment image (in-shop clothes), which has attracted increasing attention from the multimedia and computer vision communities.

Semantic Parsing Virtual Try-on

Paper
Code

Category-Adaptive Label Discovery and Noise Rejection for Multi-label Image Recognition with Partial Positive Labels

no code implementations • 15 Nov 2022 • Tao Pu, Qianru Lao, Hefeng Wu, Tianshui Chen, Liang Lin

To reject noisy labels, recent works regard large loss samples as noise but ignore the semantic correlation different multi-label images.

Paper
Add Code

Dual-Perspective Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels

1 code implementation • 26 May 2022 • Tao Pu, Tianshui Chen, Hefeng Wu, Yukai Shi, Zhijing Yang, Liang Lin

Specifically, an instance-perspective representation blending (IPRB) module is designed to blend the representations of the known labels in an image with the representations of the corresponding unknown labels in another image to complement these unknown labels.

Ranked #3 on Multi-label Image Recognition with Partial Labels on PASCAL VOC 2007

Image Classification Multi-label Image Recognition with Partial Labels

Paper
Code

Heterogeneous Semantic Transfer for Multi-label Recognition with Partial Labels

1 code implementation • 23 May 2022 • Tianshui Chen, Tao Pu, Lingbo Liu, Yukai Shi, Zhijing Yang, Liang Lin

Multi-label image recognition with partial labels (MLR-PL), in which some labels are known while others are unknown for each image, may greatly reduce the cost of annotation and thus facilitate large-scale MLR.

Ranked #4 on Multi-label Image Recognition with Partial Labels on PASCAL VOC 2007

Multi-label Image Recognition with Partial Labels

Paper
Code

Exploring Negatives in Contrastive Learning for Unpaired Image-to-Image Translation

no code implementations • 23 Apr 2022 • Yupei Lin, Sen Zhang, Tianshui Chen, Yongyi Lu, Guangping Li, Yukai Shi

Recently, contrastive learning (CL) has been used to further investigate the image correspondence in unpaired image translation by using patch-based positive/negative learning.

Contrastive Learning Image-to-Image Translation +1

Paper
Add Code

Semantic Representation and Dependency Learning for Multi-Label Image Recognition

no code implementations • 8 Apr 2022 • Tao Pu, Mingzhan Sun, Hefeng Wu, Tianshui Chen, Ling Tian, Liang Lin

We also design an object erasing (OE) module to implicitly learn semantic dependency among categories by erasing semantic-aware regions to regularize the network training.

Object object-detection +1

Paper
Add Code

Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels

1 code implementation • 4 Mar 2022 • Tao Pu, Tianshui Chen, Hefeng Wu, Liang Lin

However, these algorithms depend on sufficient multi-label annotations to train the models, leading to poor performance especially with low known label proportion.

Ranked #2 on Multi-label Image Recognition with Partial Labels on Visual Genome

Multi-label Image Recognition with Partial Labels

Paper
Code

Structured Semantic Transfer for Multi-Label Recognition with Partial Labels

1 code implementation • 21 Dec 2021 • Tianshui Chen, Tao Pu, Hefeng Wu, Yuan Xie, Liang Lin

To reduce the annotation cost, we propose a structured semantic transfer (SST) framework that enables training multi-label recognition models with partial labels, i. e., merely some labels are known while other labels are missing (also called unknown labels) per image.

Ranked #6 on Multi-label Image Recognition with Partial Labels on PASCAL VOC 2007

Multi-label Image Recognition with Partial Labels

Paper
Code

Aerial Images Meet Crowdsourced Trajectories: A New Approach to Robust Road Extraction

no code implementations • 30 Nov 2021 • Lingbo Liu, Zewei Yang, Guanbin Li, Kuo Wang, Tianshui Chen, Liang Lin

Land remote sensing analysis is a crucial research in earth science.

Representation Learning

Paper
Add Code

AU-Expression Knowledge Constrained Representation Learning for Facial Expression Recognition

1 code implementation • 29 Dec 2020 • Tao Pu, Tianshui Chen, Yuan Xie, Hefeng Wu, Liang Lin

In this work, we explore the correlations among the action units and facial expressions, and devise an AU-Expression Knowledge Constrained Representation Learning (AUE-CRL) framework to learn the AU representations without AU annotations and adaptively use representations to facilitate facial expression recognition.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Paper
Code

Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition

no code implementations • 20 Sep 2020 • Tianshui Chen, Liang Lin, Riquan Chen, Xiaolu Hui, Hefeng Wu

The framework exploits prior knowledge to guide adaptive information propagation among different categories to facilitate multi-label analysis and reduce the dependency of training samples.

Few-Shot Learning Multi-label Image Recognition with Partial Labels

Paper
Add Code

Cross-Domain Facial Expression Recognition: A Unified Evaluation Benchmark and Adversarial Graph Learning

1 code implementation • 3 Aug 2020 • Tianshui Chen, Tao Pu, Hefeng Wu, Yuan Xie, Lingbo Liu, Liang Lin

Although each declares to achieve superior performance, fair comparisons are lacking due to the inconsistent choices of the source/target datasets and feature extractors.

Ranked #1 on Cross-Domain Facial Expression Recognition on Source: AFE, Target: CK+, JAFFE, SFEW2.0, FER2013, ExpW

Cross-Domain Facial Expression Recognition Domain Adaptation +3

105

Paper
Code

Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition

1 code implementation • 3 Aug 2020 • Yuan Xie, Tianshui Chen, Tao Pu, Hefeng Wu, Liang Lin

However, most of these works focus on holistic feature adaptation, and they ignore local features that are more transferable across different datasets.

Cross-Domain Facial Expression Recognition Facial Expression Recognition (FER)

105

Paper
Code

Fine-Grained Image Captioning with Global-Local Discriminative Objective

1 code implementation • 21 Jul 2020 • Jie Wu, Tianshui Chen, Hefeng Wu, Zhi Yang, Guangchun Luo, Liang Lin

This is primarily due to (i) the conservative characteristic of traditional training objectives that drives the model to generate correct but hardly discriminative captions for similar images and (ii) the uneven word distribution of the ground-truth captions, which encourages generating highly frequent words/phrases while suppressing the less frequent but more concrete ones.

Descriptive Image Captioning +2

Paper
Code

Efficient Crowd Counting via Structured Knowledge Transfer

2 code implementations • 23 Mar 2020 • Lingbo Liu, Jiaqi Chen, Hefeng Wu, Tianshui Chen, Guanbin Li, Liang Lin

Crowd counting is an application-oriented task and its inference efficiency is crucial for real-world applications.

Crowd Counting Transfer Learning

Paper
Code

Knowledge Graph Transfer Network for Few-Shot Recognition

1 code implementation • 21 Nov 2019 • Riquan Chen, Tianshui Chen, Xiaolu Hui, Hefeng Wu, Guanbin Li, Liang Lin

In this work, we represent the semantic correlations in the form of structured knowledge graph and integrate this graph into deep neural networks to promote few-shot learning by a novel Knowledge Graph Transfer Network (KGTN).

Ranked #1 on Few-Shot Image Classification on ImageNet-FS (10-shot, all)

Few-Shot Image Classification Few-Shot Learning +2

Paper
Code

Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition

2 code implementations • ICCV 2019 • Tianshui Chen, Muxin Xu, Xiaolu Hui, Hefeng Wu, Liang Lin

Recognizing multiple labels of images is a practical and challenging task, and significant progress has been made by searching semantic-aware regions and modeling label dependency.

Ranked #8 on Multi-Label Classification on PASCAL VOC 2007

Graph Representation Learning Multi-Label Classification +1

158

Paper
Code

Semi-Supervised Video Salient Object Detection Using Pseudo-Labels

1 code implementation • ICCV 2019 • Pengxiang Yan, Guanbin Li, Yuan Xie, Zhen Li, Chuan Wang, Tianshui Chen, Liang Lin

Specifically, we present an effective video saliency detector that consists of a spatial refinement network and a spatiotemporal module.

Ranked #1 on Video Salient Object Detection on VOS-T (using extra training data)

object-detection Salient Object Detection +2

Paper
Code

Knowledge-Embedded Routing Network for Scene Graph Generation

3 code implementations • CVPR 2019 • Tianshui Chen, Weihao Yu, Riquan Chen, Liang Lin

More specifically, we show that the statistical correlations between objects appearing in images and their relationships, can be explicitly represented by a structured knowledge graph, and a routing mechanism is learned to propagate messages through the graph to explore their interactions.

Ranked #9 on Scene Graph Generation on Visual Genome

Graph Generation Scene Graph Generation

119

Paper
Code

Neural Task Planning with And-Or Graph Representations

no code implementations • 25 Aug 2018 • Tianshui Chen, Riquan Chen, Lin Nie, Xiaonan Luo, Xiaobai Liu, Liang Lin

This paper focuses on semantic task planning, i. e., predicting a sequence of actions toward accomplishing a specific task under a certain scene, which is a new problem in computer vision research.

Common Sense Reasoning valid

Paper
Add Code

Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic Embedding

1 code implementation • 14 Aug 2018 • Tianshui Chen, Wenxi Wu, Yuefang Gao, Le Dong, Xiaonan Luo, Liang Lin

In this work, we investigate simultaneously predicting categories of different levels in the hierarchy and integrating this structured correlation information into the deep neural network by developing a novel Hierarchical Semantic Embedding (HSE) framework.

Ranked #52 on Fine-Grained Image Classification on CUB-200-2011

Fine-Grained Image Classification Fine-Grained Image Recognition +1

Paper
Code

Knowledge-Embedded Representation Learning for Fine-Grained Image Recognition

no code implementations • 2 Jul 2018 • Tianshui Chen, Liang Lin, Riquan Chen, Yang Wu, Xiaonan Luo

Humans can naturally understand an image in depth with the aid of rich knowledge accumulated from daily lives or professions.

Fine-Grained Image Classification Fine-Grained Image Recognition +1

Paper
Add Code

Deep Reasoning with Knowledge Graph for Social Relationship Understanding

1 code implementation • 2 Jul 2018 • Zhouxia Wang, Tianshui Chen, Jimmy Ren, Weihao Yu, Hui Cheng, Liang Lin

And this structured knowledge can be efficiently integrated into the deep neural network architecture to promote social relationship understanding by an end-to-end trainable Graph Reasoning Model (GRM), in which a propagation mechanism is learned to propagate node message through the graph to explore the interaction between persons of interest and the contextual objects.

Ranked #2 on Visual Social Relationship Recognition on PIPA

Visual Social Relationship Recognition

Paper
Code

Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition

no code implementations • 20 Dec 2017 • Tianshui Chen, Zhouxia Wang, Guanbin Li, Liang Lin

Recognizing multiple labels of images is a fundamental but challenging task in computer vision, and remarkable progress has been attained by localizing semantic-aware image regions and predicting their labels with deep convolutional neural networks.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Learning a Wavelet-like Auto-Encoder to Accelerate Deep Neural Networks

2 code implementations • 20 Dec 2017 • Tianshui Chen, Liang Lin, WangMeng Zuo, Xiaonan Luo, Lei Zhang

In this work, aiming at a general and comprehensive way for neural network acceleration, we develop a Wavelet-like Auto-Encoder (WAE) that decomposes the original input image into two low-resolution channels (sub-images) and incorporate the WAE into the classification neural networks for joint training.

Classification General Classification +1

Paper
Code

Multi-label Image Recognition by Recurrently Discovering Attentional Regions

no code implementations • ICCV 2017 • Zhouxia Wang, Tianshui Chen, Guanbin Li, Ruijia Xu, Liang Lin

This paper proposes a novel deep architecture to address multi-label image recognition, a fundamental and practical task towards general visual understanding.

General Classification Multi-Label Image Classification +1

Paper
Add Code

Content-Adaptive Sketch Portrait Generation by Decompositional Representation Learning

no code implementations • 4 Oct 2017 • Dongyu Zhang, Liang Lin, Tianshui Chen, Xian Wu, Wenwei Tan, Ebroul Izquierdo

Sketch portrait generation benefits a wide range of applications such as digital entertainment and law enforcement.

Representation Learning

Paper
Add Code

Knowledge-Guided Recurrent Neural Network Learning for Task-Oriented Action Prediction

no code implementations • 15 Jul 2017 • Liang Lin, Lili Huang, Tianshui Chen, Yukang Gan, Hui Cheng

This paper aims at task-oriented action prediction, i. e., predicting a sequence of actions towards accomplishing a specific task under a certain scene, which is a new problem in computer vision research.

Common Sense Reasoning valid

Paper
Add Code

Learning to Segment Object Candidates via Recursive Neural Networks

no code implementations • 4 Dec 2016 • Tianshui Chen, Liang Lin, Xian Wu, Nong Xiao, Xiaonan Luo

To avoid the exhaustive search over locations and scales, current state-of-the-art object detection systems usually involve a crucial component generating a batch of candidate object proposals from images.

Object object-detection +1

Paper
Add Code

Character Proposal Network for Robust Text Extraction

no code implementations • 13 Feb 2016 • Shuye Zhang, Mude Lin, Tianshui Chen, Lianwen Jin, Liang Lin

Maximally stable extremal regions (MSER), which is a popular method to generate character proposals/candidates, has shown superior performance in scene text detection.

Scene Text Detection Text Detection

Paper
Add Code

DISC: Deep Image Saliency Computing via Progressive Representation Learning

no code implementations • 13 Nov 2015 • Tianshui Chen, Liang Lin, Lingbo Liu, Xiaonan Luo, Xuelong. Li

Our DISC framework is capable of uniformly highlighting the objects-of-interest from complex background while preserving well object details.

object-detection Representation Learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.