1 code implementation • 14 Mar 2024 • Jiaqing Zhang, Mingxiang Cao, Xue Yang, Weiying Xie, Jie Lei, Daixun Li, Geng Yang, Wenbo Huang, Yunsong Li
Multimodal image fusion and object detection play a vital role in autonomous driving.
2 code implementations • 29 Feb 2024 • Zi-Kai Xiao, Guo-Ye Yang, Xue Yang, Tai-Jiang Mu, Junchi Yan, Shi-Min Hu
Considerable efforts have been devoted to Oriented Object Detection (OOD).
no code implementations • 27 Feb 2024 • Xue Yang, Changchun Bao, Jing Zhou, Xianhong Chen
These weighting matrices reflect the similarity among different frames of the T-F representations and are further employed to obtain the consistent T-F representations of the enrollment.
no code implementations • 14 Dec 2023 • Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai
Many reinforcement learning environments (e. g., Minecraft) provide only sparse rewards that indicate task completion or failure with binary values.
no code implementations • 8 Dec 2023 • Tongkun Guan, Wei Shen, Xue Yang, Xuehui Wang, Xiaokang Yang
Existing scene text detection methods typically rely on extensive real data for training.
no code implementations • 3 Dec 2023 • Xue Yang, Enda Howley, Micheal Schukat
In this paper, we model thresholding in anomaly detection as a Markov Decision Process and propose an agent-based dynamic thresholding (ADT) framework based on a deep Q-network.
1 code implementation • 23 Nov 2023 • Junwei Luo, Xue Yang, Yi Yu, Qingyun Li, Junchi Yan, Yansheng Li
Single point-supervised object detection is gaining attention due to its cost-effectiveness.
2 code implementations • 23 Nov 2023 • Yi Yu, Xue Yang, Qingyun Li, Feipeng Da, Jifeng Dai, Yu Qiao, Junchi Yan
To our best knowledge, Point2RBox is the first end-to-end solution for point-supervised OOD.
no code implementations • 22 Nov 2023 • Guangming Cao, Xuehui Yu, Wenwen Yu, Xumeng Han, Xue Yang, Guorong Li, Jianbin Jiao, Zhenjun Han
In this study, we introduce the P2RBox network, which leverages point annotations and a mask generator to create mask proposals, followed by filtration through our Inspector Module and Constrainer Module.
no code implementations • 20 Nov 2023 • Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Dunyun He, Jiaqi Zhou, Wenxian Yu
In this paper, we aim to develop open-vocabulary object detection (OVD) technique in aerial images that scales up object vocabulary size beyond training data.
no code implementations • 21 Jun 2023 • Cheng Yang, Xue Yang, Dongxian Wu, Xiaohu Tang
Then the server aggregates all the proxy datasets to form a central dummy dataset, which is used to finetune aggregated global model.
no code implementations • 15 Jun 2023 • Xue Yang, Zifeng Liu, Xiaohu Tang, Rongxing Lu, Bo Liu
With the emergence of privacy leaks in federated learning, secure aggregation protocols that mainly adopt either homomorphic encryption or threshold secret sharing have been widely developed for federated learning to protect the privacy of the local training data of each client.
2 code implementations • 9 May 2023 • Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, LiMin Wang, Ping Luo, Jifeng Dai, Yu Qiao
Different from existing interactive systems that rely on pure language, by incorporating pointing instructions, the proposed iGPT significantly improves the efficiency of communication between users and chatbots, as well as the accuracy of chatbots in vision-centric tasks, especially in complicated visual scenarios where the number of objects is greater than 2.
1 code implementation • 9 Mar 2023 • Ying Zeng, Yushi Chen, Xue Yang, Qingyun Li, Junchi Yan
Existing oriented object detection methods commonly use metric AP$_{50}$ to measure the performance of the model.
1 code implementation • 6 Feb 2023 • Qinrou Wen, Jirui Yang, Xue Yang, Kewei Liang
To further refine masks obtained by compressed vectors, we propose for the first time a compressed vector based multi-stage refinement framework.
1 code implementation • ICCV 2023 • Tongkun Guan, Wei Shen, Xue Yang, Qi Feng, Zekun Jiang, Xiaokang Yang
Therefore, exploring the robust text feature representations on unlabeled real images by self-supervised learning is a good solution.
3 code implementations • 13 Oct 2022 • Xue Yang, Gefan Zhang, Wentong Li, Xuehui Wang, Yue Zhou, Junchi Yan
Oriented object detection emerges in many applications from aerial images to autonomous driving, while many existing detection benchmarks are annotated with horizontal bounding box only which is also less costive than fine-grained rotated box, leading to a gap between the readily available training corpus and the rising demand for oriented object detection.
1 code implementation • 22 Sep 2022 • Xue Yang, Gefan Zhang, Xiaojiang Yang, Yue Zhou, Wentao Wang, Jin Tang, Tao He, Junchi Yan
Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects and an additional rotation angle parameter is used for rotated objects.
1 code implementation • 4 Aug 2022 • Yiming Li, Mingyan Zhu, Xue Yang, Yong Jiang, Tao Wei, Shu-Tao Xia
The rapid development of DNNs has benefited from the existence of some high-quality datasets ($e. g.$, ImageNet), which allow researchers and developers to easily verify the performance of their methods.
1 code implementation • 24 May 2022 • Liping Hou, Ke Lu, Xue Yang, Yuqiu Li, Jian Xue
To go further, in this paper, we propose a unified Gaussian representation called G-Rep to construct Gaussian distributions for OBB, QBB, and PointSet, which achieves a unified solution to various representations and problems.
1 code implementation • 28 Apr 2022 • Yue Zhou, Xue Yang, Gefan Zhang, Jiabao Wang, Yanyi Liu, Liping Hou, Xue Jiang, Xingzhao Liu, Junchi Yan, Chengqi Lyu, Wenwei Zhang, Kai Chen
We present an open-source toolbox, named MMRotate, which provides a coherent algorithm framework of training, inferring, and evaluation for the popular rotated object detection algorithm based on deep learning.
no code implementations • 25 Mar 2022 • Xue Yang, Changchun Bao
Various network architectures, from traditional convolutional neural network (CNN) and recurrent neural network (RNN) to advanced transformer, have been designed sophistically to improve separation performance.
1 code implementation • CVPR 2023 • Tongkun Guan, Chaochen Gu, Jingzheng Tu, Xue Yang, Qi Feng, Yudi Zhao, Xiaokang Yang, Wei Shen
Supervised attention can alleviate the above issue, but it is character category-specific, which requires extra laborious character-level bounding box annotations and would be memory-intensive when handling languages with larger character categories.
Ranked #2 on Scene Text Recognition on ICDAR 2003
3 code implementations • 29 Jan 2022 • Xue Yang, Yue Zhou, Gefan Zhang, Jirui Yang, Wentao Wang, Junchi Yan, Xiaopeng Zhang, Qi Tian
This is in contrast to recent Gaussian modeling based rotation detectors e. g. GWD loss and KLD loss that involve a human-specified distribution distance metric which require additional hyperparameter tuning that vary across datasets and detectors.
no code implementations • CVPR 2022 • Wentao Wang, Li Niu, Jianfu Zhang, Xue Yang, Liqing Zhang
Different from feed-forward methods, they seek for a closest latent code to the corrupted image and feed it to a pretrained generator.
1 code implementation • 12 Nov 2021 • Xue Yang, Yue Zhou, Junchi Yan
AlphaRotate is an open-source Tensorflow benchmark for performing scalable rotation detection on various datasets.
1 code implementation • 24 Sep 2021 • Wen Qian, Xue Yang, Silong Peng, Junchi Yan, Xiujuan Zhang
We classify the discontinuity of loss in both five-param and eight-param rotated object detection methods as rotation sensitivity error (RSE) which will result in performance degeneration.
no code implementations • 10 Jun 2021 • Mengyuan Fang, Luliang Tang, Zihan Kan, Xue Yang, Tao Pei, Qingquan Li, Chaokui Li
As an important spatial analysis approach, the clustering methods of point events have been extended to OD flows to identify the dominant trends and spatial structures of urban mobility.
2 code implementations • NeurIPS 2021 • Xue Yang, Xiaojiang Yang, Jirui Yang, Qi Ming, Wentao Wang, Qi Tian, Junchi Yan
Taking the perspective that horizontal detection is a special case for rotated object detection, in this paper, we are motivated to change the design of rotation regression loss from induction paradigm to deduction methodology, in terms of the relation between rotation and horizontal detection.
Ranked #14 on Object Detection In Aerial Images on DOTA (using extra training data)
1 code implementation • 22 Mar 2021 • Qi Ming, Lingjuan Miao, Zhiqiang Zhou, Xue Yang, Yunpeng Dong
In this paper, we propose a Representation Invariance Loss (RIL) to optimize the bounding box regression for the rotating objects.
Ranked #27 on Object Detection In Aerial Images on DOTA (using extra training data)
2 code implementations • 28 Jan 2021 • Xue Yang, Junchi Yan, Qi Ming, Wentao Wang, Xiaopeng Zhang, Qi Tian
Boundary discontinuity and its inconsistency to the final detection metric have been the bottleneck for rotating detection regression loss design.
Ranked #16 on Object Detection In Aerial Images on DOTA (using extra training data)
no code implementations • ICCV 2021 • Wentao Wang, Jianfu Zhang, Li Niu, Haoyu Ling, Xue Yang, Liqing Zhang
Conventional deep image inpainting methods are based on auto-encoder architecture, in which the spatial details of images will be lost in the down-sampling process, leading to the degradation of generated results.
3 code implementations • CVPR 2021 • Xue Yang, Liping Hou, Yue Zhou, Wentao Wang, Junchi Yan
Rotation detection serves as a fundamental building block in many visual applications involving aerial image, scene text, and face etc.
Ranked #29 on Object Detection In Aerial Images on DOTA (using extra training data)
no code implementations • 21 Aug 2020 • Yiming Li, Jiawang Bai, Jiawei Li, Xue Yang, Yong Jiang, Shu-Tao Xia
Interpretability and effectiveness are two essential and indispensable requirements for adopting machine learning methods in reality.
5 code implementations • 28 Apr 2020 • Xue Yang, Junchi Yan, Wenlong Liao, Xiaokang Yang, Jin Tang, Tao He
Instance-level denoising on the feature map is performed to enhance the detection to small and cluttered objects.
Ranked #33 on Object Detection In Aerial Images on DOTA (using extra training data)
4 code implementations • ECCV 2020 • Xue Yang, Junchi Yan
For the resulting circularly distributed angle classification problem, we first devise a Circular Smooth Label technique to handle the periodicity of angle and increase the error tolerance to adjacent angles.
Ranked #37 on Object Detection In Aerial Images on DOTA (using extra training data)
1 code implementation • 23 Feb 2020 • Xue Yang, Yan Feng, Weijun Fang, Jun Shao, Xiaohu Tang, Shu-Tao Xia, Rongxing Lu
However, the strong defence ability and high learning accuracy of these schemes cannot be ensured at the same time, which will impede the wide application of FL in practice (especially for medical or financial institutions that require both high accuracy and strong privacy guarantee).
2 code implementations • 19 Nov 2019 • Wen Qian, Xue Yang, Silong Peng, Yue Guo, Junchi Yan
Popular rotated detection methods usually use five parameters (coordinates of the central point, width, height, and rotation angle) to describe the rotated bounding box and l1-loss as the loss function.
Ranked #43 on Object Detection In Aerial Images on DOTA (using extra training data)
no code implementations • 17 Sep 2019 • Yi Liu, Chao Pang, Zongqian Zhan, Xiaomeng Zhang, Xue Yang
In recent years, building change detection methods have made great progress by introducing deep learning, but they still suffer from the problem of the extracted features not being discriminative enough, resulting in incomplete regions and irregular boundaries.
Building change detection for remote sensing images Change Detection +3
10 code implementations • 15 Aug 2019 • Xue Yang, Junchi Yan, Ziming Feng, Tao He
Considering the shortcoming of feature misalignment in existing refined single-stage detector, we design a feature refinement module to improve detection performance by getting more accurate features.
no code implementations • 27 Jul 2019 • Ang Li, Xue Yang, Chongyang Zhang
We extend the state-of-the-art Cascade R-CNN with a simple feature sharing mechanism.
no code implementations • 4 Apr 2019 • Tengfei Zhang, Yue Zhang, Xian Sun, Hao Sun, Menglong Yan, Xue Yang, Kun fu
A two-stage detector for OSCD is introduced to compare the extracted query and target features with the learnable metric to approach the optimized non-linear conditional probability.
no code implementations • 10 Mar 2019 • Yiming Li, Jiawang Bai, Jiawei Li, Xue Yang, Yong Jiang, Chun Li, Shu-Tao Xia
Despite the impressive performance of random forests (RF), its theoretical properties have not been thoroughly understood.
3 code implementations • ICCV 2019 • Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo, Sun Xian, Kun fu
Specifically, a sampling fusion network is devised which fuses multi-layer feature with effective anchor sampling, to improve the sensitivity to small objects.
Ranked #47 on Object Detection In Aerial Images on DOTA (using extra training data)
3 code implementations • 13 Jun 2018 • Xue Yang, Hao Sun, Xian Sun, Menglong Yan, Zhi Guo, Kun fu
The complexity of application scenarios, the redundancy of detection region, and the difficulty of dense ship detection are all the main obstacles that limit the successful operation of traditional methods in ship detection.
4 code implementations • 12 Jun 2018 • Xue Yang, Hao Sun, Kun fu, Jirui Yang, Xian Sun, Menglong Yan, Zhi Guo
Additionally, in the case of ship rotation and dense arrangement, we design a rotation anchor strategy to predict the minimum circumscribed rectangle of the object so as to reduce the redundant detection region and improve the recall.
no code implementations • 24 Feb 2017 • Fei Han, Xue Yang, Yu Zhang, Hao Zhang
Apprenticeship learning has recently attracted a wide attention due to its capability of allowing robots to learn physical tasks directly from demonstrations provided by human experts.
no code implementations • 24 Feb 2017 • Fei Han, Xue Yang, Christopher Reardon, Yu Zhang, Hao Zhang
We formulate FABL as a regression-like optimization problem with structured sparsity-inducing norms to model interrelationships of body parts and features.
no code implementations • 30 Apr 2016 • Xue Yang, Fei Han, Hua Wang, Hao Zhang
Sparse representation has been widely studied in visual tracking, which has shown promising tracking performance.