Search Results for author: Jun Liu

Found 275 papers, 62 papers with code

Learning Progressive Joint Propagation for Human Motion Prediction

no code implementations ECCV 2020 Yujun Cai, Lin Huang, Yiwei Wang, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Xu Yang, Yiheng Zhu, Xiaohui Shen, Ding Liu, Jing Liu, Nadia Magnenat Thalmann

Last, in order to incorporate a general motion space for high-quality prediction, we build a memory-based dictionary, which aims to preserve the global motion patterns in training data to guide the predictions.

Human motion prediction motion prediction

HARD-Net: Hardness-AwaRe Discrimination Network for 3D Early Activity Prediction

no code implementations ECCV 2020 Tianjiao Li, Jun Liu, Wei zhang, Ling-Yu Duan

In this paper, we propose a novel Hardness-AwaRe Discrimination Network (HARD-Net) to specifically investigate the relationships between the similar activity pairs that are hard to be discriminated.

Activity Prediction Skeleton Based Action Recognition

OID: Outlier Identifying and Discarding in Blind Image Deblurring

no code implementations ECCV 2020 Liang Chen, Faming Fang, Jiawei Zhang, Jun Liu, Guixu Zhang

Even a small amount of outliers can dramatically degrade the quality of the estimated blur kernel, because the outliers are not conforming to the linear formation of the blurring process.

Blind Image Deblurring Image Deblurring

Collaborative Learning of Gesture Recognition and 3D Hand Pose Estimation with Multi-Order Feature Analysis

no code implementations ECCV 2020 Siyuan Yang, Jun Liu, Shijian Lu, Meng Hwa Er, Alex C. Kot

The proposed network exploits joint-aware features that are crucial for both tasks, with which gesture recognition and 3D hand pose estimation boost each other to learn highly discriminative features and models.

3D Hand Pose Estimation Gesture Recognition

Adaptive Computationally Efficient Network for Monocular 3D Hand Pose Estimation

no code implementations ECCV 2020 Zhipeng Fan, Jun Liu, Yao Wang

A novel model, called Adaptive Computationally Efficient (ACE) network, is proposed, which takes advantage of a Gaussian kernel based Gate Module to dynamically switch the computation between a light model and a heavy network for feature extraction.

3D Hand Pose Estimation

Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering

no code implementations18 Apr 2024 Jie Ma, Min Hu, Pinghui Wang, Wangchun Sun, Lingyun Song, Hongbin Pei, Jun Liu, Youtian Du

The former leads to a large, diverse test space, while the latter results in a comprehensive robustness evaluation on rare, frequent, and overall questions.

An Interpretable Power System Transient Stability Assessment Method with Expert Guiding Neural-Regression-Tree

no code implementations3 Apr 2024 Hanxuan Wang, Na Lu, Zixuan Wang, Jiacheng Liu, Jun Liu

TSA-ENRT utilizes an expert guiding nonlinear regression tree to approximate the neural network prediction and the neural network can be explained by the interpretive rules generated by the tree model.

regression

Action Detection via an Image Diffusion Process

no code implementations1 Apr 2024 Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Jun Liu

Action detection aims to localize the starting and ending points of action instances in untrimmed videos, and predict the classes of those instances.

Action Detection Image Generation

LLMs are Good Sign Language Translators

no code implementations1 Apr 2024 Jia Gong, Lin Geng Foo, Yixuan He, Hossein Rahmani, Jun Liu

Sign Language Translation (SLT) is a challenging task that aims to translate sign videos into spoken language.

Sign Language Translation Translation

LLMs are Good Action Recognizers

no code implementations31 Mar 2024 Haoxuan Qu, Yujun Cai, Jun Liu

Motivated by this, we propose a novel LLM-AR framework, in which we investigate treating the Large Language Model as an Action Recognizer.

Action Recognition Language Modelling +3

Learning Piecewise Residuals of Control Barrier Functions for Safety of Switching Systems using Multi-Output Gaussian Processes

no code implementations26 Mar 2024 Mohammad Aali, Jun Liu

This uncertainty results in piecewise residuals for each switching surface, impacting the CLF and CBF constraints.

Gaussian Processes

Manifold-Guided Lyapunov Control with Diffusion Models

1 code implementation26 Mar 2024 Amartya Mukherjee, Thanin Quartz, Jun Liu

This paper presents a novel approach to generating stabilizing controllers for a large class of dynamical systems using diffusion models.

GPT-Connect: Interaction between Text-Driven Human Motion Generator and 3D Scenes in a Training-free Manner

no code implementations22 Mar 2024 Haoxuan Qu, Ziyan Guo, Jun Liu

Recently, while text-driven human motion generation has received massive research attention, most existing text-driven motion generators are generally only designed to generate motion sequences in a blank background.

SoftPatch: Unsupervised Anomaly Detection with Noisy Data

1 code implementation NeurIPS 2022 Xi Jiang, Ying Chen, Qiang Nie, Yong liu, Jianlin Liu, Bin-Bin Gao, Jun Liu, Chengjie Wang, Feng Zheng

Noise discriminators are utilized to generate outlier scores for patch-level noise elimination before coreset construction.

Unsupervised Anomaly Detection

Efficient Pruning of Large Language Model with Adaptive Estimation Fusion

no code implementations16 Mar 2024 Jun Liu, Chao Wu, Changdi Yang, Hao Tang, Haoye Dong, Zhenglun Kong, Geng Yuan, Wei Niu, Dong Huang, Yanzhi Wang

Large language models (LLMs) have become crucial for many generative downstream tasks, leading to an inevitable trend and significant challenge to deploy them efficiently on resource-constrained devices.

Language Modelling Large Language Model

Enhancing Human-Centered Dynamic Scene Understanding via Multiple LLMs Collaborated Reasoning

no code implementations15 Mar 2024 Hang Zhang, Wenxiao Zhang, Haoxuan Qu, Jun Liu

Human-centered dynamic scene understanding plays a pivotal role in enhancing the capability of robotic and autonomous systems, in which Video-based Human-Object Interaction (V-HOI) detection is a crucial task in semantic scene understanding, aimed at comprehensively understanding HOI relationships within a video to benefit the behavioral decisions of mobile robots and autonomous driving systems.

Autonomous Driving Human-Object Interaction Detection +2

LyZNet: A Lightweight Python Tool for Learning and Verifying Neural Lyapunov Functions and Regions of Attraction

no code implementations15 Mar 2024 Jun Liu, Yiming Meng, Maxwell Fitzsimmons, Ruikun Zhou

In this paper, we describe a lightweight Python framework that provides integrated learning and verification of neural Lyapunov functions for stability analysis.

Compositionally Verifiable Vector Neural Lyapunov Functions for Stability Analysis of Interconnected Nonlinear Systems

1 code implementation15 Mar 2024 Jun Liu, Yiming Meng, Maxwell Fitzsimmons, Ruikun Zhou

While there has been increasing interest in using neural networks to compute Lyapunov functions, verifying that these functions satisfy the Lyapunov conditions and certifying stability regions remain challenging due to the curse of dimensionality.

LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content

no code implementations9 Mar 2024 QiHao Zhao, Yalun Dai, Hao Li, Wei Hu, Fan Zhang, Jun Liu

Long-tail recognition is challenging because it requires the model to learn good representations from tail categories and address imbalances across all categories.

InstructGIE: Towards Generalizable Image Editing

no code implementations8 Mar 2024 Zichong Meng, Changdi Yang, Jun Liu, Hao Tang, Pu Zhao, Yanzhi Wang

In response to this challenge, our study introduces a novel image editing framework with enhanced generalization robustness by boosting in-context learning capability and unifying language instruction.

Denoising In-Context Learning

IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding

no code implementations28 Feb 2024 Lanyun Zhu, Deyi Ji, Tianrun Chen, Peng Xu, Jieping Ye, Jun Liu

Despite achieving rapid developments and with widespread applications, Large Vision-Language Models (LVLMs) confront a serious challenge of being prone to generating hallucinations.

An inexact Bregman proximal point method and its acceleration version for unbalanced optimal transport

no code implementations26 Feb 2024 Xiang Chen, Faqiang Wang, Jun Liu, Li Cui

The algorithm (1) converges to the true solution of UOT, (2) has theoretical guarantees and robust regularization parameter selection, (3) mitigates numerical stability issues, and (4) can achieve comparable computational complexity to the Scaling algorithm in specific practice.

Physics-Informed Neural Network Policy Iteration: Algorithms, Convergence, and Verification

no code implementations15 Feb 2024 Yiming Meng, Ruikun Zhou, Amartya Mukherjee, Maxwell Fitzsimmons, Christopher Song, Jun Liu

We provide a theoretical analysis of both algorithms in terms of convergence of neural approximations towards the true optimal solutions in a general setting.

Denoising Diffusion Restoration Tackles Forward and Inverse Problems for the Laplace Operator

no code implementations13 Feb 2024 Amartya Mukherjee, Melissa M. Stadt, Lena Podina, Mohammad Kohandel, Jun Liu

Equivalently, we present an approach to restore the solution and the parameters in the Poisson equation by exploiting the eigenvalues and the eigenfunctions of the Laplacian operator.

Denoising

AoSRNet: All-in-One Scene Recovery Networks via Multi-knowledge Integration

1 code implementation6 Feb 2024 Yuxu Lu, Dong Yang, Yuan Gao, Ryan Wen Liu, Jun Liu, Yu Guo

Additionally, we suggest a multi-receptive field extraction module (MEM) to attenuate the loss of image texture details caused by GC nonlinear and OLS linear transformations.

Autonomous Vehicles

Source-free Domain Adaptive Object Detection in Remote Sensing Images

no code implementations31 Jan 2024 Weixing Liu, Jun Liu, Xin Su, Han Nie, Bin Luo

To address this challenge, we propose a practical source-free object detection (SFOD) setting for RS images, which aims to perform target domain adaptation using only the source pre-trained model.

Domain Adaptation object-detection +1

Left-right Discrepancy for Adversarial Attack on Stereo Networks

no code implementations14 Jan 2024 Pengfei Wang, Xiaofei Hui, Beijia Lu, Nimrod Lilith, Jun Liu, Sameer Alam

Stereo matching neural networks often involve a Siamese structure to extract intermediate features from left and right images.

Adversarial Attack Disparity Estimation +1

Learning from Semi-Factuals: A Debiased and Semantic-Aware Framework for Generalized Relation Discovery

no code implementations12 Jan 2024 Jiaxin Wang, Lingling Zhang, Jun Liu, Tianlin Guo, Wenjun Wu

The key challenges of GRD are how to mitigate the serious model biases caused by labeled pre-defined relations to learn effective relational representations and how to determine the specific semantics of novel relations during classifying or clustering unlabeled instances.

Relation Relation Extraction +2

FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs

no code implementations8 Jan 2024 Shulin Zeng, Jun Liu, Guohao Dai, Xinhao Yang, Tianyu Fu, Hongyi Wang, Wenheng Ma, Hanbo Sun, Shiyao Li, Zixiao Huang, Yadong Dai, Jintao Li, Zehao Wang, Ruoyu Zhang, Kairui Wen, Xuefei Ning, Yu Wang

However, existing GPU and transformer-based accelerators cannot efficiently process compressed LLMs, due to the following unresolved challenges: low computational efficiency, underutilized memory bandwidth, and large compilation overheads.

Computational Efficiency Language Modelling +2

Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

1 code implementation3 Jan 2024 Haopeng Li, Andong Deng, Qiuhong Ke, Jun Liu, Hossein Rahmani, Yulan Guo, Bernt Schiele, Chen Chen

Reasoning over sports videos for question answering is an important task with numerous applications, such as player training and information retrieval.

Action Understanding counterfactual +4

Double-well Net for Image Segmentation

no code implementations31 Dec 2023 Hao liu, Jun Liu, Raymond Chan, Xue-Cheng Tai

In this study, our goal is to integrate classical mathematical models with deep neural networks by introducing two novel deep neural network models for image segmentation known as Double-well Nets.

Image Segmentation Segmentation +1

6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation

no code implementations29 Dec 2023 Li Xu, Haoxuan Qu, Yujun Cai, Jun Liu

Estimating the 6D object pose from a single RGB image often involves noise and indeterminacy due to challenges such as occlusions and cluttered backgrounds.

6D Pose Estimation using RGB Denoising +1

MatchDet: A Collaborative Framework for Image Matching and Object Detection

no code implementations18 Dec 2023 Jinxiang Lai, Wenlong Wu, Bin-Bin Gao, Jun Liu, Jiawei Zhan, Congchong Nie, Yi Zeng, Chengjie Wang

Image matching and object detection are two fundamental and challenging tasks, while many related applications consider them two individual tasks (i. e. task-individual).

object-detection Object Detection

Physics-Informed Neural Network Lyapunov Functions: PDE Characterization, Learning, and Verification

no code implementations14 Dec 2023 Jun Liu, Yiming Meng, Maxwell Fitzsimmons, Ruikun Zhou

We provide a systematic investigation of using physics-informed neural networks to compute Lyapunov functions.

LLaFS: When Large Language Models Meet Few-Shot Segmentation

no code implementations28 Nov 2023 Lanyun Zhu, Tianrun Chen, Deyi Ji, Jieping Ye, Jun Liu

This paper proposes LLaFS, the first attempt to leverage large language models (LLMs) in few-shot segmentation.

Attribute Segmentation

Enabling Fast 2-bit LLM on GPUs: Memory Alignment and Asynchronous Dequantization

no code implementations28 Nov 2023 Jinhao Li, Shiyao Li, Jiaming Xu, Shan Huang, Yaoxiu Lian, Jun Liu, Yu Wang, Guohao Dai

Weights are quantized by groups, while the ranges of weights are large in some groups, resulting in large quantization errors and nonnegligible accuracy loss (e. g. >3% for Llama2-7b with 2-bit quantization in GPTQ and Greenbit).

Quantization

ZeroPS: High-quality Cross-modal Knowledge Transfer for Zero-Shot 3D Part Segmentation

no code implementations24 Nov 2023 Yuheng Xue, Nenglun Chen, Jun Liu, Wenyun Sun

The main idea of our approach is to explore the natural relationship between multi-view correspondences and the prompt mechanism of foundational models and build bridges on it.

3D Part Segmentation Transfer Learning +1

Trustworthy Large Models in Vision: A Survey

no code implementations16 Nov 2023 Ziyan Guo, Li Xu, Jun Liu

The rapid progress of Large Models (LMs) has recently revolutionized various fields of deep learning with remarkable grades, ranging from Natural Language Processing (NLP) to Computer Vision (CV).

Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

no code implementations15 Nov 2023 Fangzhi Xu, Zhiyong Wu, Qiushi Sun, Siyu Ren, Fei Yuan, Shuai Yuan, Qika Lin, Yu Qiao, Jun Liu

Although Large Language Models (LLMs) demonstrate remarkable ability in processing and generating human-like text, they do have limitations when it comes to comprehending and expressing world knowledge that extends beyond the boundaries of natural language(e. g., chemical molecular formula).

World Knowledge

FlashDecoding++: Faster Large Language Model Inference on GPUs

no code implementations2 Nov 2023 Ke Hong, Guohao Dai, Jiaming Xu, Qiuli Mao, Xiuhong Li, Jun Liu, Kangdi Chen, Yuhan Dong, Yu Wang

A single and static dataflow may lead to a 50. 25% performance loss for GEMMs of different shapes in LLM inference.

Language Modelling Large Language Model

FuXi-Extreme: Improving extreme rainfall and wind forecasts with diffusion model

no code implementations25 Oct 2023 Xiaohui Zhong, Lei Chen, Jun Liu, Chensen Lin, Yuan Qi, Hao Li

State-of-the-art ML-based weather forecast models, such as FuXi, have demonstrated superior statistical forecast performance in comparison to the high-resolution forecasts (HRES) of the European Centre for Medium-Range Weather Forecasts (ECMWF).

Denoising Weather Forecasting

Recoverable Privacy-Preserving Image Classification through Noise-like Adversarial Examples

1 code implementation19 Oct 2023 Jun Liu, Jiantao Zhou, Jinyu Tian, Weiwei Sun

Extensive experiments demonstrate that 1) the classification accuracy of the classifier trained in the plaintext domain remains the same in both the ciphertext and plaintext domains; 2) the encrypted images can be recovered into their original form with an average PSNR of up to 51+ dB for the SVHN dataset and 48+ dB for the VGGFace2 dataset; 3) our system exhibits satisfactory generalization capability on the encryption, decryption and classification tasks across datasets that are different from the training one; and 4) a high-level of security is achieved against three potential threat models.

Cloud Computing Image Classification +1

Generating Robust Adversarial Examples against Online Social Networks (OSNs)

1 code implementation19 Oct 2023 Jun Liu, Jiantao Zhou, Haiwei Wu, Weiwei Sun, Jinyu Tian

In this work, we aim to design a new framework for generating robust AEs that can survive the OSN transmission; namely, the AEs before and after the OSN transmission both possess strong attack capabilities.

Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification

no code implementations17 Oct 2023 Shuanglin Yan, Neng Dong, Jun Liu, Liyan Zhang, Jinhui Tang

Since the support set is unavailable during inference, we propose to distill the knowledge learned by the "richer" model into a lightweight model for inference with a single image/text as input.

Image Retrieval Image-text matching +2

MAC: ModAlity Calibration for Object Detection

no code implementations14 Oct 2023 Yutian Lei, Jun Liu, Dong Huang

The flourishing success of Deep Neural Networks(DNNs) on RGB-input perception tasks has opened unbounded possibilities for non-RGB-input perception tasks, such as object detection from wireless signals, lidar scans, and infrared images.

Object object-detection +1

Harmonic Control Lyapunov Barrier Functions for Constrained Optimal Control with Reach-Avoid Specifications

no code implementations4 Oct 2023 Amartya Mukherjee, Ruikun Zhou, Haocheng Chang, Jun Liu

This paper introduces harmonic control Lyapunov barrier functions (harmonic CLBF) that aid in constrained control problems such as reach-avoid problems.

Revisiting Cephalometric Landmark Detection from the view of Human Pose Estimation with Lightweight Super-Resolution Head

1 code implementation29 Sep 2023 Qian Wu, Si Yong Yeo, Yufei Chen, Jun Liu

Accurate localization of cephalometric landmarks holds great importance in the fields of orthodontics and orthognathics due to its potential for automating key point labeling.

Pose Estimation Quantization +1

Deep Learning Overloaded Vehicle Identification for Long Span Bridges Based on Structural Health Monitoring Data

no code implementations4 Sep 2023 Yuqin Li, Jun Liu, Shengliang Zhong, Licheng Zhou, Shoubin Dong, Zejia Liu, Liqun Tang

In this paper, a deep learning based overloaded vehicle identification approach (DOVI) is proposed, with the purpose of overloaded vehicle identification for long-span bridges by the use of structural health monitoring data.

AI-Generated Content (AIGC) for Various Data Modalities: A Survey

no code implementations27 Aug 2023 Lin Geng Foo, Hossein Rahmani, Jun Liu

Due to its wide range of applications and the demonstrated potential of recent works, AIGC developments have been attracting lots of attention recently, and AIGC methods have been developed for various data modalities, such as image, video, text, 3D shape (as voxels, point clouds, meshes, and neural implicit fields), 3D scene, 3D human avatar (body and head), 3D motion, and audio -- each presenting different characteristics and challenges.

Unsupervised Domain Adaptation via Domain-Adaptive Diffusion

no code implementations26 Aug 2023 Duo Peng, Qiuhong Ke, Yinjie Lei, Jun Liu

Unsupervised Domain Adaptation (UDA) is quite challenging due to the large distribution discrepancy between the source domain and the target domain.

Unsupervised Domain Adaptation

DISGO: Automatic End-to-End Evaluation for Scene Text OCR

no code implementations25 Aug 2023 Mei-Yuh Hwang, Yangyang Shi, Ankit Ramchandani, Guan Pang, Praveen Krishnan, Lucas Kabela, Frank Seide, Samyak Datta, Jun Liu

This paper discusses the challenges of optical character recognition (OCR) on natural scenes, which is harder than OCR on documents due to the wild content and various image backgrounds.

Machine Translation Optical Character Recognition +2

Distribution-Aligned Diffusion for Human Mesh Recovery

no code implementations ICCV 2023 Lin Geng Foo, Jia Gong, Hossein Rahmani, Jun Liu

Inspired by their capability, we explore a diffusion-based approach for human mesh recovery, and propose a Human Mesh Diffusion (HMDiff) framework which frames mesh recovery as a reverse diffusion process.

Denoising Human Mesh Recovery

Diffusion-based Image Translation with Label Guidance for Domain Adaptive Semantic Segmentation

no code implementations ICCV 2023 Duo Peng, Ping Hu, Qiuhong Ke, Jun Liu

Translating images from a source domain to a target domain for learning target models is one of the most common strategies in domain adaptive semantic segmentation (DASS).

Denoising Semantic Segmentation +1

MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition

1 code implementation ICCV 2023 QiHao Zhao, Chen Jiang, Wei Hu, Fan Zhang, Jun Liu

In the analysis and ablation study, we demonstrate that our method compared with previous work can effectively increase the diversity of experts, significantly reduce the variance of the model, and improve recognition accuracy.

Long-tail Learning

Learning Gabor Texture Features for Fine-Grained Recognition

no code implementations ICCV 2023 Lanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu

We innovatively utilize Gabor filters as a powerful extractor to exploit texture features, motivated by the capability of Gabor filters in effectively capturing multi-frequency features and detailed local information.

M$^3$Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition

no code implementations6 Aug 2023 Hao Tang, Jun Liu, Shuanglin Yan, Rui Yan, Zechao Li, Jinhui Tang

Due to the scarcity of manually annotated data required for fine-grained video understanding, few-shot fine-grained (FS-FG) action recognition has gained significant attention, with the aim of classifying novel fine-grained action categories with only a few labeled instances.

Decision Making Fine-grained Action Recognition +1

A Theoretically Guaranteed Quaternion Weighted Schatten p-norm Minimization Method for Color Image Restoration

1 code implementation24 Jul 2023 Qing-Hua Zhang, Liang-Tian He, Yi-Lun Wang, Liang-Jian Deng, Jun Liu

Very recently, a quaternion-based WNNM approach (QWNNM) has been developed to mitigate this issue, which is capable of representing the color image as a whole in the quaternion domain and preserving the inherent correlation among the three color channels.

Color Image Denoising Deblurring +2

Robust Visual Question Answering: Datasets, Methods, and Future Challenges

no code implementations21 Jul 2023 Jie Ma, Pinghui Wang, Dechen Kong, Zewei Wang, Jun Liu, Hongbin Pei, Junzhou Zhao

Specifically, we first provide an overview of the development process of datasets from in-distribution and out-of-distribution perspectives.

Question Answering Visual Question Answering

One-Shot Action Recognition via Multi-Scale Spatial-Temporal Skeleton Matching

no code implementations14 Jul 2023 Siyuan Yang, Jun Liu, Shijian Lu, Er Meng Hwa, Alex C. Kot

The first is multi-scale matching which captures the scale-wise semantic relevance of skeleton data at multiple spatial and temporal scales simultaneously.

Action Recognition

Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond

1 code implementation16 Jun 2023 Fangzhi Xu, Qika Lin, Jiawei Han, Tianzhe Zhao, Jun Liu, Erik Cambria

Firstly, to offer systematic evaluations, we select fifteen typical logical reasoning datasets and organize them into deductive, inductive, abductive and mixed-form reasoning settings.

Benchmarking Evidence Selection +2

Artificial General Intelligence for Medical Imaging

no code implementations8 Jun 2023 Xiang Li, Lu Zhang, Zihao Wu, Zhengliang Liu, Lin Zhao, Yixuan Yuan, Jun Liu, Gang Li, Dajiang Zhu, Pingkun Yan, Quanzheng Li, Wei Liu, Tianming Liu, Dinggang Shen

In this review, we explore the potential applications of Artificial General Intelligence (AGI) models in healthcare, focusing on foundational Large Language Models (LLMs), Large Vision Models, and Large Multimodal Models.

Actor-Critic Methods using Physics-Informed Neural Networks: Control of a 1D PDE Model for Fluid-Cooled Battery Packs

1 code implementation18 May 2023 Amartya Mukherjee, Jun Liu

The Hamilton-Jacobi-Bellman (HJB) equation is a PDE that evaluates the optimality of the value function and determines an optimal controller.

Adaptive loose optimization for robust question answering

1 code implementation6 May 2023 Jie Ma, Pinghui Wang, Zewei Wang, Dechen Kong, Min Hu, Ting Han, Jun Liu

Question answering methods are well-known for leveraging data bias, such as the language prior in visual question answering and the position bias in machine reading comprehension (extractive question answering).

Extractive Question-Answering Machine Reading Comprehension +2

Self-Supervised 3D Action Representation Learning with Skeleton Cloud Colorization

no code implementations18 Apr 2023 Siyuan Yang, Jun Liu, Shijian Lu, Er Meng Hwa, Yongjian Hu, Alex C. Kot

We investigate self-supervised representation learning and design a novel skeleton cloud colorization technique that is capable of learning spatial and temporal skeleton representations from unlabeled skeleton sequence data.

Colorization Representation Learning +2

Meta Compositional Referring Expression Segmentation

no code implementations CVPR 2023 Li Xu, Mark He Huang, Xindi Shang, Zehuan Yuan, Ying Sun, Jun Liu

Then, following a novel meta optimization scheme to optimize the model to obtain good testing performance on the virtual testing sets after training on the virtual training set, our framework can effectively drive the model to better capture semantics and visual representations of individual concepts, and thus obtain robust generalization performance even when handling novel compositions.

Meta-Learning Referring Expression +2

Token Boosting for Robust Self-Supervised Visual Transformer Pre-training

no code implementations CVPR 2023 Tianjiao Li, Lin Geng Foo, Ping Hu, Xindi Shang, Hossein Rahmani, Zehuan Yuan, Jun Liu

Pre-training VTs on such corrupted data can be challenging, especially when we pre-train via the masked autoencoding approach, where both the inputs and masked ``ground truth" targets can potentially be unreliable in this case.

Progressive Channel-Shrinking Network

no code implementations1 Apr 2023 Jianhong Pan, Siyuan Yang, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Zhipeng Fan, Jun Liu

Currently, salience-based channel pruning makes continuous breakthroughs in network compression.

GradMDM: Adversarial Attack on Dynamic Networks

no code implementations1 Apr 2023 Jianhong Pan, Lin Geng Foo, Qichen Zheng, Zhipeng Fan, Hossein Rahmani, Qiuhong Ke, Jun Liu

Dynamic neural networks can greatly reduce computation redundancy without compromising accuracy by adapting their structures based on the input.

Adversarial Attack

Optimal Mixed-ADC arrangement for DOA Estimation via CRB using ULA

no code implementations27 Mar 2023 Xinnan Zhang, Yuanbo Cheng, Xiaolei Shang, Jun Liu

Simulation results show the validity of the asymptotic CRB and better performance under the optimal mixed-precision arrangement.

SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning

1 code implementation15 Mar 2023 Jinxiang Lai, Siqian Yang, Wenlong Wu, Tao Wu, Guannan Jiang, Xi Wang, Jun Liu, Bin-Bin Gao, Wei zhang, Yuan Xie, Chengjie Wang

Then we derive two specific attention modules, named SpatialFormer Semantic Attention (SFSA) and SpatialFormer Target Attention (SFTA), to enhance the target object regions while reduce the background distraction.

Few-Shot Learning

Precise Facial Landmark Detection by Reference Heatmap Transformer

no code implementations14 Mar 2023 Jun Wan, Jun Liu, Jie zhou, Zhihui Lai, Linlin Shen, Hang Sun, Ping Xiong, Wenwen Min

Most facial landmark detection methods predict landmarks by mapping the input facial appearance features to landmark heatmaps and have achieved promising results.

Facial Landmark Detection

Robustly Complete Finite-State Abstractions for Control Synthesis of Stochastic Systems

no code implementations8 Mar 2023 Yiming Meng, Jun Liu

The essential step of abstraction-based control synthesis for nonlinear systems to satisfy a given specification is to obtain a finite-state abstraction of the original systems.

Almost Sure Saddle Avoidance of Stochastic Gradient Methods without the Bounded Gradient Assumption

no code implementations15 Feb 2023 Jun Liu, Ye Yuan

We prove that various stochastic gradient descent methods, including the stochastic gradient descent (SGD), stochastic heavy-ball (SHB), and stochastic Nesterov's accelerated gradient (SNAG) methods, almost surely avoid any strict saddle manifold.

Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models

1 code implementation10 Feb 2023 Yang Liu, Dingkang Yang, Yan Wang, Jing Liu, Jun Liu, Azzedine Boukerche, Peng Sun, Liang Song

Video Anomaly Detection (VAD) serves as a pivotal technology in the intelligent surveillance systems, enabling the temporal or spatial identification of anomalous events within videos.

Anomaly Detection Event Detection +1

Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO)

no code implementations1 Feb 2023 Amartya Mukherjee, Jun Liu

The Proximal Policy Optimization (PPO)-Clipped algorithm is improvised with this implementation as it uses a value network to compute the objective function for its policy network.

reinforcement-learning Reinforcement Learning (RL)

Modeling Uncertain Feature Representation for Domain Generalization

1 code implementation16 Jan 2023 Xiaotong Li, Zixuan Hu, Jun Liu, Yixiao Ge, Yongxing Dai, Ling-Yu Duan

In this paper, we improve the network generalization ability by modeling domain shifts with uncertainty (DSU), i. e., characterizing the feature statistics as uncertain distributions during training.

Domain Generalization Image Classification +3

STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition

no code implementations ICCV 2023 Ming Li, Xiangyu Xu, Hehe Fan, Pan Zhou, Jun Liu, Jia-Wei Liu, Jiahe Li, Jussi Keppo, Mike Zheng Shou, Shuicheng Yan

For the first time, we introduce vision Transformers into PPAR by treating a video as a tubelet sequence, and accordingly design two complementary mechanisms, i. e., sparsification and anonymization, to remove privacy from a spatio-temporal perspective.

Action Recognition Facial Expression Recognition (FER) +2

Rethinking Gradient Projection Continual Learning: Stability / Plasticity Feature Space Decoupling

no code implementations CVPR 2023 Zhen Zhao, Zhizhong Zhang, Xin Tan, Jun Liu, Yanyun Qu, Yuan Xie, Lizhuang Ma

In this paper, we propose a space decoupling (SD) algorithm to decouple the feature space into a pair of complementary subspaces, i. e., the stability space I, and the plasticity space R. I is established by conducting space intersection between the historic and current feature space, and thus I contains more task-shared bases.

Continual Learning

Bi-Directional Feature Fusion Generative Adversarial Network for Ultra-High Resolution Pathological Image Virtual Re-Staining

no code implementations CVPR 2023 Kexin Sun, Zhineng Chen, Gongwei Wang, Jun Liu, Xiongjun Ye, Yu-Gang Jiang

In order to eliminate the square effect, we design a bi-directional feature fusion generative adversarial network (BFF-GAN) with a global branch and a local branch.

Generative Adversarial Network

A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation

no code implementations CVPR 2023 Haoxuan Qu, Yujun Cai, Lin Geng Foo, Ajay Kumar, Jun Liu

Therefore, via minimizing the distance between the two characteristic functions, we can optimize the model to provide a more accurate localization result for the body joints in different sub-regions of the predicted heatmap.

Pose Estimation

Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events

1 code implementation ICCV 2023 Kian Eng Ong, Xun Long Ng, Yanchao Li, Wenjie Ai, Kuangyi Zhao, Si Yong Yeo, Jun Liu

Understanding and analyzing human behaviors (actions and interactions of people), voices, and sounds in chaotic events is crucial in many applications, e. g., crowd management, emergency response services.

Action Localization Pathfinder

Unified Pose Sequence Modeling

no code implementations CVPR 2023 Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Qiuhong Ke, Jun Liu

We propose a Unified Pose Sequence Modeling approach to unify heterogeneous human behavior understanding tasks based on pose data, e. g., action recognition, 3D pose estimation and 3D early action prediction.

3D Pose Estimation Action Recognition +1

Heterogeneous Diversity Driven Active Learning for Multi-Object Tracking

no code implementations ICCV 2023 Rui Li, Baopeng Zhang, Jun Liu, Wei Liu, Jian Zhao, Zhu Teng

HD-AMOT defines the diversified informative representation by encoding the geometric and semantic information, and formulates the frame inference strategy as a Markov decision process to learn an optimal sampling policy based on the designed informative representation.

Active Learning Multi-Object Tracking

GPTR: Gestalt-Perception Transformer for Diagram Object Detection

no code implementations29 Dec 2022 Xin Hu, Lingling Zhang, Jun Liu, Jinfu Fan, Yang You, Yaqiang Wu

These lead to the fact that traditional data-driven detection model is not suitable for diagrams.

Object object-detection +2

HDNet: A Hierarchically Decoupled Network for Crowd Counting

no code implementations12 Dec 2022 Chenliang Gu, Changan Wang, Bin-Bin Gao, Jun Liu, Tianliang Zhang

Recently, density map regression-based methods have dominated in crowd counting owing to their excellent fitting ability on density distribution.

Crowd Counting Density Estimation +1

DiffPose: Toward More Reliable 3D Pose Estimation

1 code implementation CVPR 2023 Jia Gong, Lin Geng Foo, Zhipeng Fan, Qiuhong Ke, Hossein Rahmani, Jun Liu

Monocular 3D human pose estimation is quite challenging due to the inherent ambiguity and occlusion, which often lead to high uncertainty and indeterminacy.

3D Pose Estimation Monocular 3D Human Pose Estimation

Global Meets Local: Effective Multi-Label Image Classification via Category-Aware Weak Supervision

no code implementations23 Nov 2022 Jiawei Zhan, Jun Liu, Wei Tang, Guannan Jiang, Xi Wang, Bin-Bin Gao, Tianliang Zhang, Wenlong Wu, Wei zhang, Chengjie Wang, Yuan Xie

This paper builds a unified framework to perform effective noisy-proposal suppression and to interact between global and local features for robust feature learning.

Feature Correlation Multi-Label Image Classification

Rethinking the Metric in Few-shot Learning: From an Adaptive Multi-Distance Perspective

no code implementations2 Nov 2022 Jinxiang Lai, Siqian Yang, Guannan Jiang, Xi Wang, Yuxi Li, Zihui Jia, Xiaochen Chen, Jun Liu, Bin-Bin Gao, Wei zhang, Yuan Xie, Chengjie Wang

In this paper, for the first time, we investigate the contributions of different distance metrics, and propose an adaptive fusion scheme, bringing significant improvements in few-shot classification.

Few-Shot Learning

tSF: Transformer-based Semantic Filter for Few-Shot Learning

1 code implementation2 Nov 2022 Jinxiang Lai, Siqian Yang, Wenlong Liu, Yi Zeng, Zhongyi Huang, Wenlong Wu, Jun Liu, Bin-Bin Gao, Chengjie Wang

Few-Shot Learning (FSL) alleviates the data shortage challenge via embedding discriminative target-aware features among plenty seen (base) and few unseen (novel) labeled samples.

Few-Shot Learning object-detection +1

Improving the Reliability for Confidence Estimation

no code implementations13 Oct 2022 Haoxuan Qu, Yanchao Li, Lin Geng Foo, Jason Kuen, Jiuxiang Gu, Jun Liu

Confidence estimation, a task that aims to evaluate the trustworthiness of the model's prediction output during deployment, has received lots of research attention recently, due to its importance for the safe deployment of deep models.

Image Classification Meta-Learning +1

Uncertainty-Aware Unsupervised Image Deblurring with Deep Residual Prior

no code implementations CVPR 2023 Xiaole Tang, XiLe Zhao, Jun Liu, Jianli Wang, Yuchun Miao, Tieyong Zeng

To address this challenge, we suggest a dataset-free deep residual prior for the kernel induced error (termed as residual) expressed by a customized untrained deep neural network, which allows us to flexibly adapt to different blurs and images in real scenarios.

Deblurring Image Deblurring

Heatmap Distribution Matching for Human Pose Estimation

no code implementations3 Oct 2022 Haoxuan Qu, Li Xu, Yujun Cai, Lin Geng Foo, Jun Liu

In this paper, we show that optimizing the heatmap prediction in such a way, the model performance of body joint localization, which is the intrinsic objective of this task, may not be consistently improved during the optimization process of the heatmap prediction.

2D Human Pose Estimation Pose Estimation

Downlink Compression Improves TopK Sparsification

no code implementations30 Sep 2022 William Zou, Hans De Sterck, Jun Liu

One of the largest bottlenecks in distributed training is communicating gradients across different nodes.

Multiple Control Barrier Functions: An Application to Reactive Obstacle Avoidance for a Multi-steering Tractor-trailer System

no code implementations12 Sep 2022 Mohammad Aali, Jun Liu

We develop a control structure based on a multiple CBFs scheme for a multi-steering tractor-trailer system to ensure a collision-free maneuver for both the tractor and trailer in the presence of several obstacles.

Model Predictive Control

nVFNet-RDC: Replay and Non-Local Distillation Collaboration for Continual Object Detection

no code implementations8 Sep 2022 Jinxiang Lai, Wenlong Liu, Jun Liu

Continual Learning (CL) focuses on developing algorithms with the ability to adapt to new environments and learn new skills.

Continual Learning object-detection +1

Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition

no code implementations3 Sep 2022 Tianjiao Li, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Anran Wang, Jinghua Wang, Jun Liu

We design a novel Dynamic Spatio-Temporal Specialization (DSTS) module, which consists of specialized neurons that are only activated for a subset of samples that are highly similar.

Fine-grained Action Recognition

Incremental Few-Shot Semantic Segmentation via Embedding Adaptive-Update and Hyper-class Representation

no code implementations26 Jul 2022 Guangchen Shi, Yirui Wu, Jun Liu, Shaohua Wan, Wenhai Wang, Tong Lu

Second, to resist overfitting issues caused by few training samples, a hyper-class embedding is learned by clustering all category embeddings for initialization and aligned with category embedding of the new class for enhancement, where learned knowledge assists to learn new knowledge, thus alleviating performance dependence on training data scale.

Few-Shot Semantic Segmentation Segmentation +1

Meta Spatio-Temporal Debiasing for Video Scene Graph Generation

no code implementations23 Jul 2022 Li Xu, Haoxuan Qu, Jason Kuen, Jiuxiang Gu, Jun Liu

Video scene graph generation (VidSGG) aims to parse the video content into scene graphs, which involves modeling the spatio-temporal contextual information in the video.

Graph Generation Meta-Learning +2

ERA: Expert Retrieval and Assembly for Early Action Prediction

no code implementations20 Jul 2022 Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Qiuhong Ke, Jun Liu

Early action prediction aims to successfully predict the class label of an action before it is completely performed.

Early Action Prediction Retrieval

Position-prior Clustering-based Self-attention Module for Knee Cartilage Segmentation

1 code implementation21 Jun 2022 Dong Liang, Jun Liu, Kuanquan Wang, Gongning Luo, Wei Wang, Shuo Li

The morphological changes in knee cartilage (especially femoral and tibial cartilages) are closely related to the progression of knee osteoarthritis, which is expressed by magnetic resonance (MR) images and assessed on the cartilage segmentation results.

Clustering Position +1

Neural Lyapunov Control of Unknown Nonlinear Systems with Stability Guarantees

1 code implementation4 Jun 2022 Ruikun Zhou, Thanin Quartz, Hans De Sterck, Jun Liu

This paper proposes a learning framework to simultaneously stabilize an unknown nonlinear system with a neural controller and learn a neural Lyapunov function to certify a region of attraction (ROA) for the closed-loop system.

valid

Data-Driven Learning of Safety-Critical Control with Stochastic Control Barrier Functions

no code implementations22 May 2022 Chuanzheng Wang, Yiming Meng, Stephen L. Smith, Jun Liu

More specifically, we propose a data-driven stochastic control barrier function (DDSCBF) framework and use supervised learning to learn the unknown stochastic dynamics via the DDSCBF scheme.

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

2 code implementations11 May 2022 Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang

The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.

Image Super-Resolution

Surface Representation for Point Clouds

1 code implementation CVPR 2022 Haoxi Ran, Jun Liu, Chengjie Wang

Based on a simple baseline of PointNet++ (SSG version), Umbrella RepSurf surpasses the previous state-of-the-art by a large margin for classification, segmentation and detection on various benchmarks in terms of performance and efficiency.

3D Object Detection 3D Point Cloud Classification +2

Deeply Supervised Skin Lesions Diagnosis with Stage and Branch Attention

2 code implementations9 May 2022 Wei Dai, Rui Liu, Tianyi Wu, Min Wang, Jianqin Yin, Jun Liu

Visual features of skin lesions vary significantly because the images are collected from patients with different lesion colours and morphologies by using dissimilar imaging equipment.

Classification

Logiformer: A Two-Branch Graph Transformer Network for Interpretable Logical Reasoning

1 code implementation2 May 2022 Fangzhi Xu, Jun Liu, Qika Lin, Yudai Pan, Lingling Zhang

Firstly, we introduce different extraction strategies to split the text into two sets of logical units, and construct the logical graph and the syntax graph respectively.

Logical Reasoning Machine Reading Comprehension +1

Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding

1 code implementation CVPR 2022 Xun Long Ng, Kian Eng Ong, Qichen Zheng, Yun Ni, Si Yong Yeo, Jun Liu

More specifically, our dataset contains 50 hours of annotated videos to localize relevant animal behavior segments in long videos for the video grounding task, 30K video sequences for the fine-grained multi-label action recognition task, and 33K frames for the pose estimation task, which correspond to a diverse range of animals with 850 species across 6 major animal classes.

Animal Action Recognition Animal Pose Estimation +1

Prompt-based Generative Approach towards Multi-Hierarchical Medical Dialogue State Tracking

no code implementations18 Mar 2022 Jun Liu, Tong Ruan, Haofen Wang, Huanhuan Zhang

The dialogue state tracking (DST) module in the medical dialogue system which interprets utterances into the machine-readable structure for downstream tasks is particularly challenging.

Dialogue State Tracking

Cluster Head Detection for Hierarchical UAV Swarm With Graph Self-supervised Learning

no code implementations8 Mar 2022 Zhiyu Mou, Jun Liu, Xiang Yun, Feifei Gao, Qihui Wu

We first propose a graph attention self-supervised learning algorithm (GASSL) to detect the HUAVs of a single UAV cluster, where the GASSL can fit the IFS at the same time.

Clustering Graph Attention +3

Bridging the Source-to-target Gap for Cross-domain Person Re-Identification with Intermediate Domains

1 code implementation3 Mar 2022 Yongxing Dai, Yifan Sun, Jun Liu, Zekun Tong, Yi Yang, Ling-Yu Duan

Instead of directly aligning the source and target domains against each other, we propose to align the source and target domains against their intermediate domains for a smooth knowledge transfer.

Domain Generalization Person Re-Identification +1

Towards Class-agnostic Tracking Using Feature Decorrelation in Point Clouds

no code implementations28 Feb 2022 Shengjing Tian, Jun Liu, Xiuping Liu

In this work, we investigate a more challenging task in the LiDAR point clouds, class-agnostic tracking, where a general model is supposed to be learned for any specified targets of both observed and unseen categories.

Benchmarking Object Tracking

Theoretical Analysis of Deep Neural Networks in Physical Layer Communication

no code implementations21 Feb 2022 Jun Liu, Haitao Zhao, Dongtang Ma, Kai Mei, Jibo Wei

In this paper, we aim to quantitatively analyze why DNNs can achieve comparable performance in the physical layer comparing with traditional techniques, and also drive their cost in terms of computational complexity.

Intelligent Communication

On Almost Sure Convergence Rates of Stochastic Gradient Methods

no code implementations9 Feb 2022 Jun Liu, Ye Yuan

We further provide last-iterate almost sure convergence rates analysis for stochastic gradient methods on weakly convex smooth functions, in contrast with most existing results in the literature that only provide convergence in expectation for a weighted average of the iterates.

Uncertainty Modeling for Out-of-Distribution Generalization

1 code implementation ICLR 2022 Xiaotong Li, Yongxing Dai, Yixiao Ge, Jun Liu, Ying Shan, Ling-Yu Duan

In this paper, we improve the network generalization ability by modeling the uncertainty of domain shifts with synthesized feature statistics during training.

Image Classification Out-of-Distribution Generalization +2

En-Compactness: Self-Distillation Embedding & Contrastive Generation for Generalized Zero-Shot Learning

no code implementations CVPR 2022 Xia Kong, Zuodong Gao, Xiaofan Li, Ming Hong, Jun Liu, Chengjie Wang, Yuan Xie, Yanyun Qu

Our ICCE promotes intra-class compactness with inter-class separability on both seen and unseen classes in the embedding space and visual feature space.

Generalized Zero-Shot Learning

Meta Agent Teaming Active Learning for Pose Estimation

no code implementations CVPR 2022 Jia Gong, Zhipeng Fan, Qiuhong Ke, Hossein Rahmani, Jun Liu

The existing pose estimation approaches often require a large number of annotated images to attain good estimation performance, which are laborious to acquire.

Active Learning Pose Estimation

Under-Approximate Reachability Analysis for a Class of Linear Systems with Inputs

no code implementations27 Dec 2021 Mohamed Serry, Jun Liu

Under-approximations of reachable sets and tubes have been receiving growing research attention due to their important roles in control synthesis and verification.

3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Naïve

no code implementations23 Dec 2021 Lei Wang, Jun Liu, Piotr Koniusz

In this paper, we propose a Few-shot Learning pipeline for 3D skeleton-based action recognition by Joint tEmporal and cAmera viewpoiNt alIgnmEnt (JEANIE).

Dynamic Time Warping Few-Shot action recognition +3

Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

1 code implementation4 Dec 2021 Feng Xu, Chuang Zhu, Wenqi Tang, Ying Wang, Yu Zhang, Jie Li, Hongchuan Jiang, Zhongyue Shi, Jun Liu, Mulan Jin

Conclusion: Our study provides a novel DL-based biomarker on primary tumor CNB slides to predict the metastatic status of ALN preoperatively for patients with EBC.

Multiple Instance Learning Specificity +1

Learning First-Order Rules with Relational Path Contrast for Inductive Relation Reasoning

no code implementations17 Oct 2021 Yudai Pan, Jun Liu, Lingling Zhang, Xin Hu, Tianzhe Zhao, Qika Lin

Relation reasoning in knowledge graphs (KGs) aims at predicting missing relations in incomplete triples, whereas the dominant paradigm is learning the embeddings of relations and entities, which is limited to a transductive setting and has restriction on processing unseen entities in an inductive situation.

Knowledge Graphs Relation

Recent Advances of Continual Learning in Computer Vision: An Overview

no code implementations23 Sep 2021 Haoxuan Qu, Hossein Rahmani, Li Xu, Bryan Williams, Jun Liu

In contrast to batch learning where all training data is available at once, continual learning represents a family of methods that accumulate knowledge and learn continuously with data available in sequential order.

Continual Learning Knowledge Distillation

Cross-Site Severity Assessment of COVID-19 from CT Images via Domain Adaptation

no code implementations8 Sep 2021 Geng-Xin Xu, Chen Liu, Jun Liu, Zhongxiang Ding, Feng Shi, Man Guo, Wei Zhao, Xiaoming Li, Ying WEI, Yaozong Gao, Chuan-Xian Ren, Dinggang Shen

Particularly, we propose a domain translator and align the heterogeneous data to the estimated class prototypes (i. e., class centers) in a hyper-sphere manifold.

Computed Tomography (CT) Domain Adaptation +1

RiWNet: A moving object instance segmentation Network being Robust in adverse Weather conditions

no code implementations4 Sep 2021 Chenjie Wang, Chengyuan Li, Bin Luo, Wei Wang, Jun Liu

Then we extend SOLOV2 to capture temporal information in video to learn motion information, and propose a moving object instance segmentation network with RiWFPN called RiWNet.

Instance Segmentation Segmentation +1

Learning Inner-Group Relations on Point Clouds

1 code implementation ICCV 2021 Haoxi Ran, Wei Zhuo, Jun Liu, Li Lu

We further verify the expandability of RPNet, in terms of both depth and width, on the tasks of classification and segmentation.

3D Classification 3D Point Cloud Classification +4

Seirios: Leveraging Multiple Channels for LoRaWAN Indoor and Outdoor Localization

no code implementations16 Aug 2021 Jun Liu, Jiayao Gao, Sanjay Jha, Wen Hu

By exploiting both the original and the conjugate of the physical layer, Seirios can resolve the direct path from multiple reflectors in both indoor and outdoor environments.

Outdoor Localization Super-Resolution

Dual-Tuning: Joint Prototype Transfer and Structure Regularization for Compatible Feature Learning

1 code implementation6 Aug 2021 Yan Bai, Jile Jiao, Shengsen Wu, Yihang Lou, Jun Liu, Xuetao Feng, Ling-Yu Duan

It is a heavy workload to re-extract features of the whole database every time. Feature compatibility enables the learned new visual features to be directly compared with the old features stored in the database.

Retrieval

Global and Local Texture Randomization for Synthetic-to-Real Semantic Segmentation

no code implementations5 Aug 2021 Duo Peng, Yinjie Lei, Lingqiao Liu, Pingping Zhang, Jun Liu

In this work, we propose two simple yet effective texture randomization mechanisms, Global Texture Randomization (GTR) and Local Texture Randomization (LTR), for Domain Generalization based SRSS.

Domain Generalization Segmentation +1

IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID

3 code implementations ICCV 2021 Yongxing Dai, Jun Liu, Yifan Sun, Zekun Tong, Chi Zhang, Ling-Yu Duan

To ensure these two properties to better characterize appropriate intermediate domains, we enforce the bridge losses on intermediate domains' prediction space and feature space, and enforce a diversity loss on the two domain factors.

Domain Adaptive Person Re-Identification Person Re-Identification

Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning

no code implementations ICCV 2021 Siyuan Yang, Jun Liu, Shijian Lu, Meng Hwa Er, Alex C. Kot

We investigate unsupervised representation learning for skeleton action recognition, and design a novel skeleton cloud colorization technique that is capable of learning skeleton representations from unlabeled skeleton sequence data.

3D Action Recognition Colorization +1

DeceFL: A Principled Decentralized Federated Learning Framework

1 code implementation15 Jul 2021 Ye Yuan, Jun Liu, Dou Jin, Zuogong Yue, Ruijuan Chen, Maolin Wang, Chuan Sun, Lei Xu, Feng Hua, Xin He, Xinlei Yi, Tao Yang, Hai-Tao Zhang, Shaochun Sui, Han Ding

Although there has been a joint effort in tackling such a critical issue by proposing privacy-preserving machine learning frameworks, such as federated learning, most state-of-the-art frameworks are built still in a centralized way, in which a central client is needed for collecting and distributing model information (instead of data itself) from every other client, leading to high communication pressure and high vulnerability when there exists a failure at or attack on the central client.

Federated Learning Privacy Preserving

A Low Complexity Learning-based Channel Estimation for OFDM Systems with Online Training

no code implementations14 Jul 2021 Kai Mei, Jun Liu, Xiaoying Zhang, Kuo Cao, Nandana Rajatheva, Jibo Wei

Besides, a training data construction approach utilizing least square (LS) estimation results is proposed so that the training data can be collected during the data transmission.

BIG-bench Machine Learning

Resilient UAV Swarm Communications with Graph Convolutional Neural Network

1 code implementation30 Jun 2021 Zhiyu Mou, Feifei Gao, Jun Liu, Qihui Wu

Numerical results show that the proposed algorithms can rebuild the communication connectivity of the USNET more quickly than the existing algorithms under both one-off UEDs and general UEDs.

Meta-Learning Trajectory Planning

Interventional Video Grounding with Dual Contrastive Learning

1 code implementation CVPR 2021 Guoshun Nan, Rui Qiao, Yao Xiao, Jun Liu, Sicong Leng, Hao Zhang, Wei Lu

2) Meanwhile, we introduce a dual contrastive learning approach (DCL) to better align the text and video by maximizing the mutual information (MI) between query and video clips, and the MI between start/end frames of a target moment and the others within a video to learn more informative visual representations.

Causal Inference Contrastive Learning +2

QFCNN: Quantum Fourier Convolutional Neural Network

no code implementations19 Jun 2021 Feihong Shen, Jun Liu

The neural network and quantum computing are both significant and appealing fields, with their interactive disciplines promising for large-scale computing tasks that are untackled by conventional computers.

Image Classification Traffic Prediction

Conterfactual Generative Zero-Shot Semantic Segmentation

no code implementations11 Jun 2021 Feihong Shen, Jun Liu, Ping Hu

In this work, we consider counterfactual methods to avoid the confounder in the original model.

Causal Inference counterfactual +4

Knowledge-aware Deep Framework for Collaborative Skin Lesion Segmentation and Melanoma Recognition

no code implementations7 Jun 2021 XiaoHong Wang, Xudong Jiang, Henghui Ding, Yuqian Zhao, Jun Liu

In this paper, we propose a novel knowledge-aware deep framework that incorporates some clinical knowledge into collaborative learning of two important melanoma diagnosis tasks, i. e., skin lesion segmentation and melanoma recognition.

Clinical Knowledge Lesion Segmentation +3

Opening the Black Box of Deep Neural Networks in Physical Layer Communication

no code implementations2 Jun 2021 Jun Liu, Haitao Zhao, Dongtang Ma, Kai Mei, Jibo Wei

Deep Neural Network (DNN)-based physical layer techniques are attracting considerable interest due to their potential to enhance communication systems.

Generalizable Person Re-identification with Relevance-aware Mixture of Experts

no code implementations CVPR 2021 Yongxing Dai, Xiaotong Li, Jun Liu, Zekun Tong, Ling-Yu Duan

Specifically, we propose a decorrelation loss to make the source domain networks (experts) keep the diversity and discriminability of individual domains' characteristics.

Generalizable Person Re-identification

Safety-Critical Control of Stochastic Systems using Stochastic Control Barrier Functions

no code implementations6 Apr 2021 Chuanzheng Wang, Yiming Meng, Stephen L. Smith, Jun Liu

We propose a notion of stochastic control barrier functions (SCBFs)and show that SCBFs can significantly reduce the control efforts, especially in the presence of noise, compared to stochastic reciprocal control barrier functions (SRCBFs), and offer a less conservative estimation of safety probability, compared to stochastic zeroing control barrier functions (SZCBFs).

A Specification-Guided Framework for Temporal Logic Control of Nonlinear Systems

no code implementations3 Apr 2021 Yinan Li, Zhibing Sun, Jun Liu

We show that the proposed algorithm is sound for full LTL specifications, and robustly complete for specifications recognizable by deterministic B\"uchi automata (DBA), the latter in the sense that control strategies can be found whenever the given specification can be satisfied with additional bounded disturbances.

Motion Planning

UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles

2 code implementations CVPR 2021 Tianjiao Li, Jun Liu, Wei zhang, Yun Ni, Wenqian Wang, Zhiheng Li

Human behavior understanding with unmanned aerial vehicles (UAVs) is of great significance for a wide range of applications, which simultaneously brings an urgent demand of large, challenging, and comprehensive benchmarks for the development and evaluation of UAV-based models.

Action Recognition Attribute +3

Rank-One Prior: Toward Real-Time Scene Recovery

no code implementations CVPR 2021 Jun Liu, Ryan Wen Liu, Jianing Sun, Tieyong Zeng

To improve visual quality under different weather/imaging conditions, we propose a real-time light correction method to recover the degraded scenes in the cases of sandstorms, underwater, and haze.

Autonomous Vehicles

SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

3 code implementations CVPR 2021 Li Xu, He Huang, Jun Liu

In this paper, we create a novel dataset, SUTD-TrafficQA (Traffic Question Answering), which takes the form of video QA based on the collected 10, 080 in-the-wild videos and annotated 62, 535 QA pairs, for benchmarking the cognitive capability of causal inference and event understanding models in complex traffic scenarios.

Autonomous Vehicles Benchmarking +4

Constrained Radar Waveform Design for Range Profiling

no code implementations18 Mar 2021 Bo Tang, Jun Liu, Hai Wang, Yihua Hu

Range profiling refers to the measurement of target response along the radar slant range.

Radar waveform design

RL-CSDia: Representation Learning of Computer Science Diagrams

no code implementations10 Mar 2021 Shaowei Wang, Lingling Zhang, Xuan Luo, Yi Yang, Xin Hu, Jun Liu

Another type of diagrams such as from Computer Science is composed of graphics containing complex topologies and relations, and research on this type of diagrams is still blank.

Question Answering Representation Learning +1

A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder

no code implementations ICCV 2021 Yujun Cai, Yiwei Wang, Yiheng Zhu, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Chuanxia Zheng, Sijie Yan, Henghui Ding, Xiaohui Shen, Ding Liu, Nadia Magnenat Thalmann

Notably, by considering this problem as a conditional generation process, we estimate a parametric distribution of the missing regions based on the input conditions, from which to sample and synthesize the full motion series.

motion prediction Motion Synthesis

Interaction via Bi-Directional Graph of Semantic Region Affinity for Scene Parsing

no code implementations ICCV 2021 Henghui Ding, HUI ZHANG, Jun Liu, Jiaxin Li, Zijian Feng, Xudong Jiang

In this work, we treat each respective region in an image as a whole, and capture the structure topology as well as the affinity among different regions.

Scene Parsing

Motion Adaptive Pose Estimation From Compressed Videos

no code implementations ICCV 2021 Zhipeng Fan, Jun Liu, Yao Wang

A novel model, called Motion Adaptive Pose Net is proposed to exploit the compressed streams to efficiently decode pose sequences from videos.

Motion Compensation Pose Estimation

Dual-Refinement: Joint Label and Feature Refinement for Unsupervised Domain Adaptive Person Re-Identification

1 code implementation26 Dec 2020 Yongxing Dai, Jun Liu, Yan Bai, Zekun Tong, Ling-Yu Duan

To this end, we propose a novel approach, called Dual-Refinement, that jointly refines pseudo labels at the off-line clustering phase and features at the on-line training phase, to alternatively boost the label purity and feature discriminability in the target domain for more reliable re-ID.

Clustering Domain Adaptive Person Re-Identification +1

Object Detection based on OcSaFPN in Aerial Images with Noise

no code implementations18 Dec 2020 Chengyuan Li, Jun Liu, Hailong Hong, Wenju Mao, Chenjie Wang, Chudi Hu, Xin Su, Bin Luo

On the basis of this, a novel octave convolution-based semantic attention feature pyramid network (OcSaFPN) is proposed to get higher accuracy in object detection with noise.

Attribute Denoising +2

A ROM-accelerated parallel-in-time preconditioner for solving all-at-once systems from evolutionary PDEs

no code implementations16 Dec 2020 Jun Liu, Zhu Wang

In this paper we propose to use model reduction techniques for speeding up the diagonalization-based parallel-in-time (ParaDIAG) preconditioner, for iteratively solving all-at-once systems from evolutionary PDEs.

Numerical Analysis Numerical Analysis Dynamical Systems

XTQA: Span-Level Explanations of the Textbook Question Answering

1 code implementation25 Nov 2020 Jie Ma, Qi Chai, Jun Liu, Qingyu Yin, Pinghui Wang, Qinghua Zheng

Textbook Question Answering (TQA) is a task that one should answer a diagram/non-diagram question given a large multi-modal context consisting of abundant essays and diagrams.

Question Answering

Skeleton-based Relational Reasoning for Group Activity Analysis

no code implementations11 Nov 2020 Mauricio Perez, Jun Liu, Alex C. Kot

In this paper, we leverage the skeleton information to learn the interactions between the individuals straight from it.

Group Activity Recognition Optical Flow Estimation +1

Multiscale Attention Guided Network for COVID-19 Diagnosis Using Chest X-ray Images

1 code implementation11 Nov 2020 Jingxiong Li, Yaqi Wang, Shuai Wang, Jun Wang, Jun Liu, Qun Jin, Lingling Sun

Moreover, massive data collection is impractical for a newly emerged disease, which limited the performance of data thirsty deep learning models.

Classification COVID-19 Diagnosis +1

Deep Learning based Monocular Depth Prediction: Datasets, Methods and Applications

no code implementations9 Nov 2020 Qing Li, Jiasong Zhu, Jun Liu, Rui Cao, Qingquan Li, Sen Jia, Guoping Qiu

Despite the rapid progress in this topic, there are lacking of a comprehensive review, which is needed to summarize the current progress and provide the future directions.

Depth Prediction Indoor Localization +2

Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR

no code implementations9 Nov 2020 Xiaohui Zhang, Frank Zhang, Chunxi Liu, Kjell Schubert, Julian Chan, Pradyot Prakash, Jun Liu, Ching-Feng Yeh, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig

In this work, to measure the accuracy and efficiency for a latency-controlled streaming automatic speech recognition (ASR) application, we perform comprehensive evaluations on three popular training criteria: LF-MMI, CTC and RNN-T.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Exact Phase Transitions of Model RB with Slower-Growing Domains

no code implementations5 Nov 2020 Jun Liu, Ke Xu, Guangyan Zhou

The second moment method has always been an effective tool to lower bound the satisfiability threshold of many random constraint satisfaction problems.

Robust Face Alignment by Multi-order High-precision Hourglass Network

no code implementations17 Oct 2020 Jun Wan, Zhihui Lai, Jun Liu, Jie zhou, Can Gao

Heatmap regression (HR) has become one of the mainstream approaches for face alignment and has obtained promising results under constrained environments.

Face Alignment regression +2

Novel Parameter Estimation and Radar Detection Approaches for Multiple Point-like Targets: Designs and Comparisons

no code implementations17 Sep 2020 Pia Addabbo, Jun Liu, Danilo Orlando, Giuseppe Ricci

In this work, we develop and compare two innovative strategies for parameter estimation and radar detection of multiple point-like targets.

Smooth Converse Lyapunov-Barrier Theorems for Asymptotic Stability with Safety Constraints and Reach-Avoid-Stay Specifications

no code implementations9 Sep 2020 Yiming Meng, Yinan Li, Maxwell Fitzsimmons, Jun Liu

While the converse Lyapunov-barrier theorems are not constructive, as with classical converse Lyapunov theorems, we believe that the unified necessary and sufficient conditions with a single Lyapunov-barrier function are of theoretical interest and can hopefully shed some light on computational approaches.

Radar Adaptive Detection Architectures for Heterogeneous Environments

no code implementations4 Aug 2020 Jun Liu, Davide Massaro, Danilo Orlando, Alfonso Farina

In this paper, four adaptive radar architectures for target detection in heterogeneous Gaussian environments are devised.

Cannot find the paper you are looking for? You can Submit a new open access paper.