Search Results for author: Antoni B. Chan

Found 57 papers, 18 papers with code

Learning Tracking Representations from Single Point Annotations

no code implementations15 Apr 2024 Qiangqiang Wu, Antoni B. Chan

In this paper, we propose to learn tracking representations from single point annotations (i. e., 4. 5x faster to annotate than the traditional bounding box) in a weakly supervised manner.

Contrastive Learning Visual Tracking

A Fixed-Point Approach to Unified Prompt-Based Counting

no code implementations15 Mar 2024 Wei Lin, Antoni B. Chan

Additionally, a contrastive training scheme is implemented to mitigate dataset bias inherent in current class-agnostic counting datasets, a strategy whose effectiveness is confirmed by our ablation study.

Robust Unsupervised Crowd Counting and Localization with Adaptive Resolution SAM

no code implementations27 Feb 2024 Jia Wan, Qiangqiang Wu, Wei Lin, Antoni B. Chan

The existing crowd counting models require extensive training data, which is time-consuming to annotate.

Crowd Counting

Edit Temporal-Consistent Videos with Image Diffusion Model

no code implementations17 Aug 2023 Yuanzhi Wang, Yong Li, Xiaoya Zhang, Xin Liu, Anbo Dai, Antoni B. Chan, Zhen Cui

In addition to the utilization of a pretrained T2I 2D Unet for spatial content manipulation, we establish a dedicated temporal Unet architecture to faithfully capture the temporal coherence of the input video sequences.

Video Temporal Consistency

Human Attention-Guided Explainable Artificial Intelligence for Computer Vision Models

no code implementations5 May 2023 Guoyang Liu, Jindi Zhang, Antoni B. Chan, Janet H. Hsiao

We examined whether embedding human attention knowledge into saliency-based explainable AI (XAI) methods for computer vision models could enhance their plausibility and faithfulness.

Classification Explainable artificial intelligence +5

ODAM: Gradient-based instance-specific visual explanations for object detection

no code implementations13 Apr 2023 Chenyang Zhao, Antoni B. Chan

We propose the gradient-weighted Object Detector Activation Maps (ODAM), a visualized explanation technique for interpreting the predictions of object detectors.

Attribute Object +2

DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking Tasks

1 code implementation CVPR 2023 Qiangqiang Wu, Tianyu Yang, Ziquan Liu, Baoyuan Wu, Ying Shan, Antoni B. Chan

However, we find that this simple baseline heavily relies on spatial cues while ignoring temporal relations for frame reconstruction, thus leading to sub-optimal temporal matching representations for VOT and VOS.

 Ranked #1 on Visual Object Tracking on TrackingNet (AUC metric)

Semantic Segmentation Video Object Segmentation +2

TWINS: A Fine-Tuning Framework for Improved Transferability of Adversarial Robustness and Generalization

1 code implementation CVPR 2023 Ziquan Liu, Yi Xu, Xiangyang Ji, Antoni B. Chan

To better exploit the potential of pre-trained models in adversarial robustness, this paper focuses on the fine-tuning of an adversarially pre-trained model in various classification tasks.

Adversarial Robustness Image Classification

Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting

1 code implementation CVPR 2023 Wei Lin, Antoni B. Chan

In this paper, we propose the optimal transport minimization (OT-M) algorithm for crowd localization with density maps.

Crowd Counting

Boosting Adversarial Robustness From The Perspective of Effective Margin Regularization

no code implementations11 Oct 2022 Ziquan Liu, Antoni B. Chan

Our empirical study on feedforward DNNs demonstrates that the proposed effective margin regularization (EMR) learns large effective margins and boosts the adversarial robustness in both standard and adversarial training.

Adversarial Defense Adversarial Robustness

Pareto Optimization for Active Learning under Out-of-Distribution Data Scenarios

no code implementations4 Jul 2022 Xueying Zhan, Zeyu Dai, Qingzhong Wang, Qing Li, Haoyi Xiong, Dejing Dou, Antoni B. Chan

In this paper, we propose a sampling scheme, Monte-Carlo Pareto Optimization for Active Learning (POAL), which selects optimal subsets of unlabeled samples with fixed batch size from the unlabeled data pool.

Active Learning

An Empirical Study on Distribution Shift Robustness From the Perspective of Pre-Training and Data Augmentation

no code implementations25 May 2022 Ziquan Liu, Yi Xu, Yuanhong Xu, Qi Qian, Hao Li, Rong Jin, Xiangyang Ji, Antoni B. Chan

With our empirical result obtained from 1, 330 models, we provide the following main observations: 1) ERM combined with data augmentation can achieve state-of-the-art performance if we choose a proper pre-trained model respecting the data property; 2) specialized algorithms further improve the robustness on top of ERM when handling a specific type of distribution shift, e. g., GroupDRO for spurious correlation and CORAL for large-scale out-of-distribution data; 3) Comparing different pre-training modes, architectures and data sizes, we provide novel observations about pre-training on distribution shift, which sheds light on designing or selecting pre-training strategy for different kinds of distribution shifts.

Data Augmentation

Cross-View Cross-Scene Multi-View Crowd Counting

no code implementations CVPR 2021 Qi Zhang, Wei Lin, Antoni B. Chan

Multi-view crowd counting has been previously proposed to utilize multi-cameras to extend the field-of-view of a single camera, capturing more people in the scene, and improve counting performance for occluded people or those in low resolution.

Camera Calibration Crowd Counting

On Distinctive Image Captioning via Comparing and Reweighting

no code implementations8 Apr 2022 Jiuniu Wang, Wenjia Xu, Qingzhong Wang, Antoni B. Chan

First, we propose a distinctiveness metric -- between-set CIDEr (CIDErBtw) to evaluate the distinctiveness of a caption with respect to those of similar images.

Image Captioning Retrieval +1

A Comparative Survey of Deep Active Learning

1 code implementation25 Mar 2022 Xueying Zhan, Qingzhong Wang, Kuan-Hao Huang, Haoyi Xiong, Dejing Dou, Antoni B. Chan

In this work, We construct a DAL toolkit, DeepAL+, by re-implementing 19 highly-cited DAL methods.

Active Learning

A Lightweight and Detector-free 3D Single Object Tracker on Point Clouds

1 code implementation8 Mar 2022 Yan Xia, Qiangqiang Wu, Wei Li, Antoni B. Chan, Uwe Stilla

Recent works on 3D single object tracking treat the task as a target-specific 3D detection task, where an off-the-shelf 3D detector is commonly employed for the tracking.

3D Single Object Tracking motion prediction +1

Crowd Counting in the Frequency Domain

1 code implementation CVPR 2022 Weibo Shu, Jia Wan, Kay Chen Tan, Sam Kwong, Antoni B. Chan

By transforming the density map into the frequency domain and using the nice properties of the characteristic function, we propose a novel method that is simple, effective, and efficient.

Crowd Counting

BEV-Net: Assessing Social Distancing Compliance by Joint People Localization and Geometric Reasoning

1 code implementation ICCV 2021 Zhirui Dai, Yuepeng Jiang, Yi Li, Bo Liu, Antoni B. Chan, Nuno Vasconcelos

A dataset of crowd scenes with people annotations under a bird's eye view (BEV) and ground truth for metric distances is introduced, and several measures for the evaluation of social distance detection systems are proposed.

Pose Estimation

Group-based Distinctive Image Captioning with Memory Attention

no code implementations20 Aug 2021 Jiuniu Wang, Wenjia Xu, Qingzhong Wang, Antoni B. Chan

In particular, we propose a group-based memory attention (GMA) module, which stores object features that are unique among the image group (i. e., with low similarity to objects in other images).

Contrastive Learning Image Captioning +1

Multiple-criteria Based Active Learning with Fixed-size Determinantal Point Processes

no code implementations4 Jul 2021 Xueying Zhan, Qing Li, Antoni B. Chan

In this paper, we introduce a multiple-criteria based active learning algorithm, which incorporates three complementary criteria, i. e., informativeness, representativeness and diversity, to make appropriate selections in the active learning rounds under different data types.

Active Learning Informativeness +1

A Generalized Loss Function for Crowd Counting and Localization

no code implementations CVPR 2021 Jia Wan, Ziquan Liu, Antoni B. Chan

In this paper, we investigate learning the density map representation through an unbalanced optimal transport problem, and propose a generalized loss function to learn density maps for crowd counting and localization.

Crowd Counting

Progressive Unsupervised Learning for Visual Object Tracking

no code implementations CVPR 2021 Qiangqiang Wu, Jia Wan, Antoni B. Chan

In this paper, we propose a progressive unsupervised learning (PUL) framework, which entirely removes the need for annotated training videos in visual tracking.

Contrastive Learning Object +2

Weight Rescaling: Effective and Robust Regularization for Deep Neural Networks with Batch Normalization

no code implementations6 Feb 2021 Ziquan Liu, Yufei Cui, Jia Wan, Yu Mao, Antoni B. Chan

On the one hand, when the non-adaptive learning rate e. g. SGD with momentum is used, the effective learning rate continues to increase even after the initial training stage, which leads to an overfitting effect in many neural architectures.

Crowd Counting Image Classification +3

Variational Nested Dropout

1 code implementation CVPR 2021 Yufei Cui, Yu Mao, Ziquan Liu, Qiao Li, Antoni B. Chan, Xue Liu, Tei-Wei Kuo, Chun Jason Xue

Nested dropout is a variant of dropout operation that is able to order network parameters or features based on the pre-defined importance during training.

Representation Learning

Wide-Area Crowd Counting: Multi-View Fusion Networks for Counting in Large Scenes

1 code implementation2 Dec 2020 Qi Zhang, Antoni B. Chan

We consider three versions of the fusion framework: the late fusion model fuses camera-view density map; the naive early fusion model fuses camera-view feature maps; and the multi-view multi-scale early fusion model ensures that features aligned to the same ground-plane point have consistent scales.

Crowd Counting

Improve Generalization and Robustness of Neural Networks via Weight Scale Shifting Invariant Regularizations

no code implementations ICML Workshop AML 2021 Ziquan Liu, Yufei Cui, Antoni B. Chan

The derived regularizer is an upper bound for the input gradient of the network so minimizing the improved regularizer also benefits the adversarial robustness.

Adversarial Robustness

Tracking-by-Counting: Using Network Flows on Crowd Density Maps for Tracking Multiple Targets

no code implementations18 Jul 2020 Weihong Ren, Xinchao Wang, Jiandong Tian, Yandong Tang, Antoni B. Chan

State-of-the-art multi-object tracking~(MOT) methods follow the tracking-by-detection paradigm, where object trajectories are obtained by associating per-frame outputs of object detectors.

Cell Tracking Multi-Object Tracking +1

Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets

no code implementations ECCV 2020 Jiuniu Wang, Wenjia Xu, Qingzhong Wang, Antoni B. Chan

A wide range of image captioning models has been developed, achieving significant improvement based on popular metrics, such as BLEU, CIDEr, and SPICE.

Image Captioning Retrieval

Fine-Grained Crowd Counting

no code implementations13 Jul 2020 Jia Wan, Nikil Senthil Kumar, Antoni B. Chan

Second, we propose a complementary attention model to share information between the two branches.

Crowd Counting Management +1

Single-Frame based Deep View Synchronization for Unsynchronized Multi-Camera Surveillance

no code implementations8 Jul 2020 Qi Zhang, Antoni B. Chan

To handle the issue of unsynchronized multi-cameras, in this paper, we propose a synchronization model that works in conjunction with existing DNN-based multi-view models, thus avoiding the redesign of the whole model.

3D Pose Estimation 3D Reconstruction

Over-crowdedness Alert! Forecasting the Future Crowd Distribution

no code implementations9 Jun 2020 Yuzhen Niu, Weifeng Shi, Wenxi Liu, Shengfeng He, Jia Pan, Antoni B. Chan

In this paper, we formulate a novel crowd analysis problem, in which we aim to predict the crowd distribution in the near future given sequential frames of a crowd video without any identity annotations.

3D Crowd Counting via Multi-View Fusion with 3D Gaussian Kernels

no code implementations18 Mar 2020 Qi Zhang, Antoni B. Chan

Unlike MVMS, we propose to solve the multi-view crowd counting task through 3D feature fusion with 3D scene-level density maps, instead of the 2D ground-plane ones.

Crowd Counting

Towards Diverse and Accurate Image Captions via Reinforcing Determinantal Point Process

1 code implementation14 Aug 2019 Qingzhong Wang, Antoni B. Chan

Although significant progress has been made in the field of automatic image captioning, it is still a challenging task.

Image Captioning Reinforcement Learning (RL)

ROAM: Recurrently Optimizing Tracking Model

no code implementations CVPR 2020 Tianyu Yang, Pengfei Xu, Runbo Hu, Hua Chai, Antoni B. Chan

In this paper, we design a tracking model consisting of response generation and bounding box regression, where the first component produces a heat map to indicate the presence of the object at different positions and the second part regresses the relative bounding box shifts to anchors mounted on sliding-window locations.

Meta-Learning Response Generation

Visual Tracking via Dynamic Memory Networks

no code implementations12 Jul 2019 Tianyu Yang, Antoni B. Chan

The reading and writing process of the external memory is controlled by an LSTM network with the search feature map as input.

Template Matching Visual Tracking

Accelerating Monte Carlo Bayesian Inference via Approximating Predictive Uncertainty over Simplex

1 code implementation29 May 2019 Yufei Cui, Wuguannan Yao, Qiao Li, Antoni B. Chan, Chun Jason Xue

In this work, assuming that the exact posterior or a decent approximation is obtained, we propose a generic framework to approximate the output probability distribution induced by model posterior with a parameterized model and in an amortized fashion.

Adversarial Attack Bayesian Inference +2

Describing like humans: on diversity in image captioning

1 code implementation CVPR 2019 Qingzhong Wang, Antoni B. Chan

We find that there is still a large gap between the model and human performance in terms of both accuracy and diversity and the models that have optimized accuracy (CIDEr) have low diversity.

Image Captioning

A Fully Bayesian Infinite Generative Model for Dynamic Texture Segmentation

no code implementations13 Jan 2019 Sahar Yousefi, M. T. Manzuri Shalmani, Antoni B. Chan

A major limitation of these models concerns the automatic selection of a proper number of DTs.

Segmentation

Gated Hierarchical Attention for Image Captioning

1 code implementation30 Oct 2018 Qingzhong Wang, Antoni B. Chan

Attention modules connecting encoder and decoders have been widely applied in the field of object recognition, image captioning, visual question answering and neural machine translation, and significantly improves the performance.

Image Captioning Reinforcement Learning (RL) +2

EMHMM Simulation Study

no code implementations17 Oct 2018 Antoni B. Chan, Janet H. Hsiao

Eye Movement analysis with Hidden Markov Models (EMHMM) is a method for modeling eye fixation sequences using hidden Markov models (HMMs).

Bayesian Inference

Fusing Crowd Density Maps and Visual Object Trackers for People Tracking in Crowd Scenes

no code implementations CVPR 2018 Weihong Ren, Di Kang, Yandong Tang, Antoni B. Chan

While people tracking has been greatly improved over the recent years, crowd scenes remain particularly challenging for people tracking due to heavy occlusions, high crowd density, and significant appearance variation.

CNN+CNN: Convolutional Decoders for Image Captioning

1 code implementation23 May 2018 Qingzhong Wang, Antoni B. Chan

We also test our model on the paragraph annotation dataset, and get higher CIDEr score compared with hierarchical LSTMs

Image Captioning Sentence

Learning Dynamic Memory Networks for Object Tracking

1 code implementation ECCV 2018 Tianyu Yang, Antoni B. Chan

In this paper, we propose a dynamic memory network to adapt the template to the target's appearance variations during tracking.

Object Object Tracking +2

Recurrent Filter Learning for Visual Tracking

1 code implementation13 Aug 2017 Tianyu Yang, Antoni B. Chan

Recently using convolutional neural networks (CNNs) has gained popularity in visual tracking, due to its robust feature representation of images.

Object Visual Tracking

Beyond Counting: Comparisons of Density Maps for Crowd Analysis Tasks - Counting, Detection, and Tracking

1 code implementation29 May 2017 Di Kang, Zheng Ma, Antoni B. Chan

The goal of this paper is to evaluate density maps generated by density estimation methods on a variety of crowd analysis tasks, including counting, detection, and tracking.

Density Estimation regression

Color Orchestra: Ordering Color Palettes for Interpolation and Prediction

no code implementations17 Mar 2017 Huy Q. Phan, Hongbo Fu, Antoni B. Chan

As artists often use their personal color themes in their paintings, making these palettes appear frequently in the dataset, we employed density estimation to capture the characteristics of palette data.

Density Estimation

Crowd Counting by Adapting Convolutional Neural Networks with Side Information

no code implementations21 Nov 2016 Di Kang, Debarun Dhar, Antoni B. Chan

In order to incorporate the available side information, we propose an adaptive convolutional neural network (ACNN), where the convolutional filter weights adapt to the current scene context via the side information.

Crowd Counting Image Deconvolution

Small Instance Detection by Integer Programming on Object Density Maps

no code implementations CVPR 2015 Zheng Ma, Lei Yu, Antoni B. Chan

For each region, a sliding window (ROI) is passed over the density map to calculate the instance count within each ROI.

Novel Object Detection Object +2

Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network

no code implementations13 Jun 2014 Sijin Li, Zhi-Qiang Liu, Antoni B. Chan

We propose an heterogeneous multi-task learning framework for human pose estimation from monocular image with deep convolutional neural network.

Multi-Task Learning Pose Estimation

Leveraging Long-Term Predictions and Online-Learning in Agent-based Multiple Person Tracking

no code implementations10 Feb 2014 Wenxi Liu, Antoni B. Chan, Rynson W. H. Lau, Dinesh Manocha

We present a multiple-person tracking algorithm, based on combining particle filters and RVO, an agent-based crowd model that infers collision-free velocities so as to predict pedestrian's motion.

Position

On Approximate Inference for Generalized Gaussian Process Models

no code implementations25 Nov 2013 Lifeng Shang, Antoni B. Chan

In this paper, we consider efficient algorithms for approximate inference on GGPMs using the general form of the EFD.

Multivariate Generalized Gaussian Process Models

no code implementations2 Nov 2013 Antoni B. Chan

We propose a family of multivariate Gaussian process models for correlated outputs, based on assuming that the likelihood function takes the generic form of the multivariate exponential family distribution (EFD).

regression

Crossing the Line: Crowd Counting by Integer Programming with Local Features

no code implementations CVPR 2013 Zheng Ma, Antoni B. Chan

Next, the number of people is estimated in a set of overlapping sliding windows on the temporal slice image, using a regression function that maps from local features to a count.

Crowd Counting

The variational hierarchical EM algorithm for clustering hidden Markov models

no code implementations NeurIPS 2012 Emanuele Coviello, Gert R. Lanckriet, Antoni B. Chan

In this paper, we derive a novel algorithm to cluster hidden Markov models (HMMs) according to their probability distributions.

Clustering Music Tagging

Cannot find the paper you are looking for? You can Submit a new open access paper.