Search Results for author: Wanqing Li

Found 52 papers, 7 papers with code

Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization

no code implementations16 Sep 2023 Yueyang Li, Yonghong Hou, Wanqing Li

Point-level weakly-supervised temporal action localization (PWTAL) aims to localize actions with only a single timestamp annotation for each action instance.

Pseudo Label Weakly-supervised Temporal Action Localization +1

Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition

no code implementations6 Oct 2022 Zhimin Gao, Peitao Wang, Pei Lv, Xiaoheng Jiang, Qidong Liu, Pichao Wang, Mingliang Xu, Wanqing Li

Besides, these methods directly calculate the pair-wise global self-attention equally for all the joints in both the spatial and temporal dimensions, undervaluing the effect of discriminative local joints and the short-range temporal dynamics.

Action Recognition Skeleton Based Action Recognition

Deep Stereo Image Compression via Bi-Directional Coding

no code implementations CVPR 2022 Jianjun Lei, Xiangrui Liu, Bo Peng, Dengchao Jin, Wanqing Li, Jingxiao Gu

Existing learning-based stereo compression methods usually adopt a unidirectional approach to encoding one image independently and the other image conditioned upon the first.

Image Compression

Multi-View Stereo with Transformer

no code implementations1 Dec 2021 Jie Zhu, Bo Peng, Wanqing Li, Haifeng Shen, Zhe Zhang, Jianjun Lei

It is built upon Transformer and is capable of extracting dense features with global context and 3D consistency, which are crucial to achieving reliable matching for MVS.

A Central Difference Graph Convolutional Operator for Skeleton-Based Action Recognition

1 code implementation13 Nov 2021 Shuangyan Miao, Yonghong Hou, Zhimin Gao, Mingliang Xu, Wanqing Li

This paper proposes a new graph convolutional operator called central difference graph convolution (CDGC) for skeleton based action recognition.

Action Recognition Skeleton Based Action Recognition

Novel View Synthesis from a Single Image via Unsupervised learning

no code implementations29 Oct 2021 Bingzheng Liu, Jianjun Lei, Bo Peng, Chuanbo Yu, Wanqing Li, Nam Ling

In particular, the network consists of a token transformation module (TTM) that facilities the transformation of the features extracted from a source viewpoint image into an intrinsic representation with respect to a pre-defined reference pose and a view generation module (VGM) that synthesizes an arbitrary view from the representation.

Novel View Synthesis

Context-guided Triple Matching for Multiple Choice Question Answering

no code implementations27 Sep 2021 Xun Yao, Junlong Ma, Xinrong Hu, Junping Liu, Jie Yang, Wanqing Li

The task of multiple choice question answering (MCQA) refers to identifying a suitable answer from multiple candidates, by estimating the matching score among the triple of the passage, question and answer.

Benchmarking Multiple-choice +1

Regression on Deep Visual Features using Artificial Neural Networks (ANNs) to Predict Hydraulic Blockage at Culverts

no code implementations25 Apr 2021 Umair Iqbal, Johan Barthelemy, Wanqing Li, Pascal Perez

Positive value of $R^{2}$ score indicated the presence of correlation between visual features and hydraulic blockage and suggested that both can be interrelated with each other.

regression

Automating Visual Blockage Classification of Culverts with Deep Learning

no code implementations21 Apr 2021 Umair Iqbal, Johan Barthelemy, Wanqing Li, Pascal Perez

Blockage of culverts by transported debris materials is reported as main contributor in originating urban flash floods.

Classification General Classification

Prediction of Hydraulic Blockage at Cross Drainage Structures using Regression Analysis

no code implementations6 Mar 2021 Umair Iqbal, Johan Barthelemy, Pascal Perez, Wanqing Li

Hydraulic blockage of cross-drainage structures such as culverts is considered one of main contributor in triggering urban flash floods.

regression

A Two-stream Neural Network for Pose-based Hand Gesture Recognition

no code implementations22 Jan 2021 Chuankun Li, Shuai Li, Yanbo Gao, Xiang Zhang, Wanqing Li

The self-attention based graph convolutional network has a dynamic self-attention mechanism to adaptively exploit the relationships of all hand joints in addition to the fixed topology and local feature extraction in the GCN.

Action Recognition Hand Gesture Recognition +2

Trear: Transformer-based RGB-D Egocentric Action Recognition

no code implementations5 Jan 2021 Xiangyu Li, Yonghong Hou, Pichao Wang, Zhimin Gao, Mingliang Xu, Wanqing Li

In this paper, we propose a \textbf{Tr}ansformer-based RGB-D \textbf{e}gocentric \textbf{a}ction \textbf{r}ecognition framework, called Trear.

Action Recognition Optical Flow Estimation

Transformer Guided Geometry Model for Flow-Based Unsupervised Visual Odometry

no code implementations8 Dec 2020 Xiangyu Li, Yonghong Hou, Pichao Wang, Zhimin Gao, Mingliang Xu, Wanqing Li

In this paper, we propose a method consisting of two camera pose estimators that deal with the information from pairwise images and a short sequence of images respectively.

Visual Odometry

A Framework of Combining Short-Term Spatial/Frequency Feature Extraction and Long-Term IndRNN for Activity Recognition

no code implementations1 Nov 2020 Beidi Zhao, Shuai Li, Yanbo Gao, Chuankun Li, Wanqing Li

Smartphone sensors based human activity recognition is attracting increasing interests nowadays with the popularization of smartphones.

Human Activity Recognition

SAR-NAS: Skeleton-based Action Recognition via Neural Architecture Searching

no code implementations29 Oct 2020 Haoyuan Zhang, Yonghong Hou, Pichao Wang, Zihui Guo, Wanqing Li

The recently developed DARTS (Differentiable Architecture Search) is adopted to search for an effective network architecture that is built upon the two types of cells.

Action Recognition Skeleton Based Action Recognition

Unsupervised Domain Expansion from Multiple Sources

no code implementations26 May 2020 Jing Zhang, Wanqing Li, Lu Sheng, Chang Tang, Philip Ogunbona

Given an existing system learned from previous source domains, it is desirable to adapt the system to new domains without accessing and forgetting all the previous domains in some applications.

Domain Adaptation Unsupervised Domain Expansion

Deep Independently Recurrent Neural Network (IndRNN)

1 code implementation11 Oct 2019 Shuai Li, Wanqing Li, Chris Cook, Yanbo Gao

Recurrent neural networks (RNNs) are known to be difficult to train due to the gradient vanishing and exploding problems and thus difficult to learn long-term patterns and construct deep networks.

Language Modelling Sequential Image Classification +1

A Fusion Framework for Camouflaged Moving Foreground Detection in the Wavelet Domain

no code implementations16 Apr 2018 Shuai Li, Dinei Florencio, Wanqing Li, Yaqin Zhao, Chris Cook

Conventional methods cannot distinguish the foreground from background due to the small differences between them and thus suffer from under-detection of the camouflaged foreground objects.

Importance Weighted Adversarial Nets for Partial Domain Adaptation

1 code implementation CVPR 2018 Jing Zhang, Zewei Ding, Wanqing Li, Philip Ogunbona

This paper proposes an importance weighted adversarial nets-based method for unsupervised domain adaptation, specific for partial domain adaptation where the target domain has less number of classes compared to the source domain.

Partial Domain Adaptation Transfer Learning +2

Depth Pooling Based Large-scale 3D Action Recognition with Convolutional Neural Networks

no code implementations17 Mar 2018 Pichao Wang, Wanqing Li, Zhimin Gao, Chang Tang, Philip Ogunbona

This paper proposes three simple, compact yet effective representations of depth sequences, referred to respectively as Dynamic Depth Images (DDI), Dynamic Depth Normal Images (DDNI) and Dynamic Depth Motion Normal Images (DDMNI), for both isolated and continuous action recognition.

3D Action Recognition Gesture Recognition

Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN

11 code implementations CVPR 2018 Shuai Li, Wanqing Li, Chris Cook, Ce Zhu, Yanbo Gao

Experimental results have shown that the proposed IndRNN is able to process very long sequences (over 5000 time steps), can be used to construct very deep networks (21 layers used in the experiment) and still be trained robustly.

Language Modelling Sequential Image Classification +1

Cooperative Training of Deep Aggregation Networks for RGB-D Action Recognition

no code implementations5 Dec 2017 Pichao Wang, Wanqing Li, Jun Wan, Philip Ogunbona, Xinwang Liu

Differently from the conventional ConvNet that learns the deep separable features for homogeneous modality-based classification with only one softmax loss function, the c-ConvNet enhances the discriminative power of the deeply learned features and weakens the undesired modality discrepancy by jointly optimizing a ranking loss and a softmax loss for both homogeneous and heterogeneous modalities.

Action Recognition Temporal Action Localization

RGB-D-based Human Motion Recognition with Deep Learning: A Survey

no code implementations31 Oct 2017 Pichao Wang, Wanqing Li, Philip Ogunbona, Jun Wan, Sergio Escalera

Specifically, deep learning methods based on the CNN and RNN architectures have been adopted for motion recognition using RGB-D data.

Foreground Detection in Camouflaged Scenes

no code implementations11 Jul 2017 Shuai Li, Dinei Florencio, Yaqin Zhao, Chris Cook, Wanqing Li

This paper proposes a texture guided weighted voting (TGWV) method which can efficiently detect foreground objects in camouflaged scenes.

Skeleton-based Action Recognition Using LSTM and CNN

no code implementations6 Jul 2017 Chuankun Li, Pichao Wang, Shuang Wang, Yonghong Hou, Wanqing Li

Recent methods based on 3D skeleton data have achieved outstanding performance due to its conciseness, robustness, and view-independent representation.

Action Analysis Action Recognition +2

A Fully Trainable Network with RNN-based Pooling

no code implementations16 Jun 2017 Shuai Li, Wanqing Li, Chris Cook, Ce Zhu, Yanbo Gao

Such a network with learnable pooling function is referred to as a fully trainable network (FTN).

Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective

no code implementations11 May 2017 Jing Zhang, Wanqing Li, Philip Ogunbona, Dong Xu

This paper takes a problem-oriented perspective and presents a comprehensive review of transfer learning methods, both shallow and deep, for cross-dataset visual recognition.

Transfer Learning

Scene Flow to Action Map: A New Representation for RGB-D based Action Recognition with Convolutional Neural Networks

no code implementations CVPR 2017 Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, Chang Tang, Philip Ogunbona

Based on the scene flow vectors, we propose a new representation, namely, Scene Flow to Action Map (SFAM), that describes several long term spatio-temporal dynamics for action recognition.

3D Action Recognition

Large-scale Isolated Gesture Recognition Using Convolutional Neural Networks

no code implementations7 Jan 2017 Pichao Wang, Wanqing Li, Song Liu, Zhimin Gao, Chang Tang, Philip Ogunbona

This paper proposes three simple, compact yet effective representations of depth sequences, referred to respectively as Dynamic Depth Images (DDI), Dynamic Depth Normal Images (DDNI) and Dynamic Depth Motion Normal Images (DDMNI).

General Classification Gesture Recognition

Action Recognition Based on Joint Trajectory Maps with Convolutional Neural Networks

no code implementations30 Dec 2016 Pichao Wang, Wanqing Li, Chuankun Li, Yonghong Hou

Convolutional Neural Networks (ConvNets) have recently shown promising performance in many computer vision tasks, especially image-based recognition.

Action Recognition Skeleton Based Action Recognition +1

Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks

no code implementations8 Nov 2016 Pichao Wang, Zhaoyang Li, Yonghong Hou, Wanqing Li

Recently, Convolutional Neural Networks (ConvNets) have shown promising performances in many computer vision tasks, especially image-based recognition.

Action Recognition Temporal Action Localization

Exploiting Structure Sparsity for Covariance-based Visual Representation

no code implementations27 Oct 2016 Jianjia Zhang, Lei Wang, Luping Zhou, Wanqing Li

A variety of methods have been proposed to boost its efficacy, with some recent ones resorting to nonlinear kernel technique.

Action Recognition Temporal Action Localization

Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks

no code implementations22 Aug 2016 Pichao Wang, Wanqing Li, Song Liu, Yuyao Zhang, Zhimin Gao, Philip Ogunbona

This paper addresses the problem of continuous gesture recognition from sequences of depth maps using convolutional neutral networks (ConvNets).

General Classification Gesture Recognition

Learning a Pose Lexicon for Semantic Action Recognition

no code implementations1 Apr 2016 Lijuan Zhou, Wanqing Li, Philip Ogunbona

This paper presents a novel method for learning a pose lexicon comprising semantic poses defined by textual instructions and their associated visual poses defined by visual features.

Action Recognition Temporal Action Localization +2

Creating Simplified 3D Models with High Quality Textures

no code implementations22 Feb 2016 Song Liu, Wanqing Li, Philip Ogunbona, Yang-Wai Chow

This paper presents an extension to the KinectFusion algorithm which allows creating simplified 3D models with high quality RGB textures.

Vocal Bursts Intensity Prediction

Planogram Compliance Checking Based on Detection of Recurring Patterns

no code implementations22 Feb 2016 Song Liu, Wanqing Li, Stephen Davis, Christian Ritz, Hongda Tian

Product layout is extracted from an input image by means of unsupervised recurring pattern detection and matched via graph matching with the expected product layout specified by a planogram to measure the level of compliance.

Graph Matching

Combining ConvNets with Hand-Crafted Features for Action Recognition Based on an HMM-SVM Classifier

no code implementations1 Feb 2016 Pichao Wang, Zhaoyang Li, Yonghong Hou, Wanqing Li

This paper proposes a new framework for RGB-D-based action recognition that takes advantages of hand-designed features from skeleton data and deeply learned features from depth maps, and exploits effectively both the local and global temporal information.

Action Recognition Temporal Action Localization

RGB-D-based Action Recognition Datasets: A Survey

no code implementations21 Jan 2016 Jing Zhang, Wanqing Li, Philip O. Ogunbona, Pichao Wang, Chang Tang

Human action recognition from RGB-D (Red, Green, Blue and Depth) data has attracted increasing attention since the first work reported in 2010.

Action Recognition Temporal Action Localization

Beyond Covariance: Feature Representation With Nonlinear Kernel Matrices

no code implementations ICCV 2015 Lei Wang, Jianjia Zhang, Luping Zhou, Chang Tang, Wanqing Li

It proposes an open framework to use the kernel matrix over feature dimensions as a generic representation and discusses its properties and advantages.

Action Recognition Temporal Action Localization

Online Action Recognition based on Incremental Learning of Weighted Covariance Descriptors

no code implementations10 Nov 2015 Chang Tang, Pichao Wang, Wanqing Li

This paper presents a fast yet effective method to recognize actions from stream of noisy skeleton data, and a novel weighted covariance descriptor is adopted to accumulate evidence.

Action Recognition Incremental Learning +1

Deep Convolutional Neural Networks for Action Recognition Using Depth Map Sequences

no code implementations20 Jan 2015 Pichao Wang, Wanqing Li, Zhimin Gao, Jing Zhang, Chang Tang, Philip Ogunbona

The results show that our approach can achieve state-of-the-art results on the individual datasets and without dramatical performance degradation on the Combined Dataset.

Action Recognition Temporal Action Localization

Learning Discriminative Stein Kernel for SPD Matrices and Its Applications

1 code implementation8 Jul 2014 Jianjia Zhang, Lei Wang, Luping Zhou, Wanqing Li

A comprehensive experimental study is conducted on a variety of image classification tasks to compare our proposed discriminative Stein kernel with the original Stein kernel and other commonly used methods for evaluating the similarity between SPD matrices.

Classification General Classification +1

A Fast Approximate AIB Algorithm for Distributional Word Clustering

no code implementations CVPR 2013 Lei Wang, Jianjia Zhang, Luping Zhou, Wanqing Li

Distributional word clustering merges the words having similar probability distributions to attain reliable parameter estimation, compact classification models and even better classification performance.

Clustering Computational Efficiency +4

Cannot find the paper you are looking for? You can Submit a new open access paper.