Search Results for author: Chenhongyi Yang

Found 12 papers, 9 papers with code

PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition

1 code implementation26 Mar 2024 Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. Crowley

In this paper, we further adapt the selective scanning process of Mamba to the visual domain, enhancing its ability to learn features from two-dimensional images by (i) a continuous 2D scanning process that improves spatial continuity by ensuring adjacency of tokens in the scanning sequence, and (ii) direction-aware updating which enables the model to discern the spatial relations of tokens by encoding directional information.

Image Classification Instance Segmentation +3

EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation

no code implementations26 Mar 2024 Chenhongyi Yang, Anastasia Tkach, Shreyas Hampali, Linguang Zhang, Elliot J. Crowley, Cem Keskin

We also show that our method can be seamlessly extended to monocular settings, which achieves state-of-the-art performance on the SceneEgo dataset, improving MPJPE by 25. 5mm (21% improvement) compared to the best existing method with only 60. 7% model parameters and 36. 4% FLOPs.

Egocentric Pose Estimation

WidthFormer: Toward Efficient Transformer-based BEV View Transformation

1 code implementation8 Jan 2024 Chenhongyi Yang, Tianwei Lin, Lichao Huang, Elliot J. Crowley

In this work, we present WidthFormer, a novel transformer-based Bird's-Eye-View (BEV) 3D detection method tailored for real-time autonomous-driving applications.

3D Object Detection Autonomous Driving +3

GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation

2 code implementations13 Dec 2022 Chenhongyi Yang, Jiarui Xu, Shalini De Mello, Elliot J. Crowley, Xiaolong Wang

In each GP Block, features are first grouped together by a fixed number of learnable group tokens; we then perform Group Propagation where global information is exchanged between the grouped features; finally, global information in the updated grouped features is returned back to the image features through a transformer decoder.

Image Classification Instance Segmentation +5

Plug and Play Active Learning for Object Detection

1 code implementation21 Nov 2022 Chenhongyi Yang, Lichao Huang, Elliot J. Crowley

To overcome this challenge, we introduce Plug and Play Active Learning (PPAL), a simple and effective AL strategy for object detection.

Active Learning Image Classification +3

Prediction-Guided Distillation for Dense Object Detection

1 code implementation10 Mar 2022 Chenhongyi Yang, Mateusz Ochal, Amos Storkey, Elliot J. Crowley

Based on this, we propose Prediction-Guided Distillation (PGD), which focuses distillation on these key predictive regions of the teacher and yields considerable gains in performance over many existing KD baselines.

Dense Object Detection Knowledge Distillation +2

Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning

1 code implementation26 Nov 2021 Chenhongyi Yang, Lichao Huang, Elliot J. Crowley

The goal of contrastive learning based pre-training is to leverage large quantities of unlabeled data to produce a model that can be readily adapted downstream.

Contrastive Learning Instance Segmentation +2

Disentangle Your Dense Object Detector

2 code implementations7 Jul 2021 Zehui Chen, Chenhongyi Yang, Qiaofei Li, Feng Zhao, Zheng-Jun Zha, Feng Wu

Extensive experiments on MS COCO benchmark show that our approach can lead to 2. 0 mAP, 2. 4 mAP and 2. 2 mAP absolute improvements on RetinaNet, FCOS, and ATSS baselines with negligible extra overhead.

Disentanglement Object +2

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection

1 code implementation CVPR 2022 Chenhongyi Yang, Zehao Huang, Naiyan Wang

On the popular COCO dataset, the proposed method improves the detection mAP by 1. 0 and mAP-small by 2. 0, and the high-resolution inference speed is improved to 3. 0x on average.

object-detection Small Object Detection +1

Consistency Regularization with High-dimensional Non-adversarial Source-guided Perturbation for Unsupervised Domain Adaptation in Segmentation

no code implementations18 Sep 2020 Kaihong Wang, Chenhongyi Yang, Margrit Betke

Unsupervised domain adaptation for semantic segmentation has been intensively studied due to the low cost of the pixel-level annotation for synthetic data.

Semantic Segmentation Style Transfer +1

Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes

1 code implementation ECCV 2020 Chenhongyi Yang, Vitaly Ablavsky, Kaihong Wang, Qi Feng, Margrit Betke

While visual object detection with deep learning has received much attention in the past decade, cases when heavy intra-class occlusions occur have not been studied thoroughly.

object-detection Object Detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.