Search Results for author: Xue Yang

Found 49 papers, 29 papers with code

EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection

1 code implementation • 14 Mar 2024 • Jiaqing Zhang, Mingxiang Cao, Xue Yang, Weiying Xie, Jie Lei, Daixun Li, Geng Yang, Wenbo Huang, Yunsong Li

Multimodal image fusion and object detection play a vital role in autonomous driving.

Autonomous Driving object-detection +1

Paper
Code

Theoretically Achieving Continuous Representation of Oriented Bounding Boxes

2 code implementations • 29 Feb 2024 • Zi-Kai Xiao, Guo-Ye Yang, Xue Yang, Tai-Jiang Mu, Junchi Yan, Shi-Min Hu

Considerable efforts have been devoted to Oriented Object Detection (OOD).

Fairness object-detection +2

167

Paper
Code

Target Speaker Extraction by Directly Exploiting Contextual Information in the Time-Frequency Domain

no code implementations • 27 Feb 2024 • Xue Yang, Changchun Bao, Jing Zhou, Xianhong Chen

These weighting matrices reflect the similarity among different frames of the T-F representations and are further employed to obtain the consistent T-F representations of the enrollment.

Target Speaker Extraction

Paper
Add Code

Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft

no code implementations • 14 Dec 2023 • Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai

Many reinforcement learning environments (e. g., Minecraft) provide only sparse rewards that indicate task completion or failure with binary values.

reinforcement-learning

Paper
Add Code

Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors

no code implementations • 8 Dec 2023 • Tongkun Guan, Wei Shen, Xue Yang, Xuehui Wang, Xiaokang Yang

Existing scene text detection methods typically rely on extensive real data for training.

Scene Text Detection Text Detection

Paper
Add Code

ADT: Agent-based Dynamic Thresholding for Anomaly Detection

no code implementations • 3 Dec 2023 • Xue Yang, Enda Howley, Micheal Schukat

In this paper, we model thresholding in anomaly detection as a Markov Decision Process and propose an agent-based dynamic thresholding (ADT) framework based on a deep Q-network.

Anomaly Detection

Paper
Add Code

PointOBB: Learning Oriented Object Detection via Single Point Supervision

1 code implementation • 23 Nov 2023 • Junwei Luo, Xue Yang, Yi Yu, Qingyun Li, Junchi Yan, Yansheng Li

Single point-supervised object detection is gaining attention due to its cost-effectiveness.

Object object-detection +3

Paper
Code

Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision

2 code implementations • 23 Nov 2023 • Yi Yu, Xue Yang, Qingyun Li, Feipeng Da, Jifeng Dai, Yu Qiao, Junchi Yan

To our best knowledge, Point2RBox is the first end-to-end solution for point-supervised OOD.

Object object-detection +2

1,726

Paper
Code

P2RBox: A Single Point is All You Need for Oriented Object Detection

no code implementations • 22 Nov 2023 • Guangming Cao, Xuehui Yu, Wenwen Yu, Xumeng Han, Xue Yang, Guorong Li, Jianbin Jiao, Zhenjun Han

In this study, we introduce the P2RBox network, which leverages point annotations and a mask generator to create mask proposals, followed by filtration through our Inspector Module and Constrainer Module.

Object object-detection +2

Paper
Add Code

Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning

no code implementations • 20 Nov 2023 • Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Dunyun He, Jiaqi Zhou, Wenxian Yu

In this paper, we aim to develop open-vocabulary object detection (OVD) technique in aerial images that scales up object vocabulary size beyond training data.

Object object-detection +3

Paper
Add Code

An Efficient Virtual Data Generation Method for Reducing Communication in Federated Learning

no code implementations • 21 Jun 2023 • Cheng Yang, Xue Yang, Dongxian Wu, Xiaohu Tang

Then the server aggregates all the proxy datasets to form a central dummy dataset, which is used to finetune aggregated global model.

Federated Learning

Paper
Add Code

An Efficient and Multi-private Key Secure Aggregation for Federated Learning

no code implementations • 15 Jun 2023 • Xue Yang, Zifeng Liu, Xiaohu Tang, Rongxing Lu, Bo Liu

With the emergence of privacy leaks in federated learning, secure aggregation protocols that mainly adopt either homomorphic encryption or threshold secret sharing have been widely developed for federated learning to protect the privacy of the local training data of each client.

Federated Learning

Paper
Add Code

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

2 code implementations • 9 May 2023 • Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, LiMin Wang, Ping Luo, Jifeng Dai, Yu Qiao

Different from existing interactive systems that rely on pure language, by incorporating pointing instructions, the proposed iGPT significantly improves the efficiency of communication between users and chatbots, as well as the accuracy of chatbots in vision-centric tasks, especially in complicated visual scenarios where the number of objects is greater than 2.

Language Modelling

3,122

Paper
Code

ARS-DETR: Aspect Ratio-Sensitive Detection Transformer for Aerial Oriented Object Detection

1 code implementation • 9 Mar 2023 • Ying Zeng, Yushi Chen, Xue Yang, Qingyun Li, Junchi Yan

Existing oriented object detection methods commonly use metric AP$_{50}$ to measure the performance of the model.

Object object-detection +2

Paper
Code

PatchDCT: Patch Refinement for High Quality Instance Segmentation

1 code implementation • 6 Feb 2023 • Qinrou Wen, Jirui Yang, Xue Yang, Kewei Liang

To further refine masks obtained by compressed vectors, we propose for the first time a compressed vector based multi-stage refinement framework.

Instance Segmentation Semantic Segmentation +1

Paper
Code

Self-supervised Character-to-Character Distillation for Text Recognition

1 code implementation • ICCV 2023 • Tongkun Guan, Wei Shen, Xue Yang, Qi Feng, Zekun Jiang, Xiaokang Yang

Therefore, exploring the robust text feature representations on unlabeled real images by self-supervised learning is a good solution.

Ranked #1 on self-supervised scene text recognition on Scene Text Recognition Benchmarks

Data Augmentation Representation Learning +7

130

Paper
Code

H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection

3 code implementations • 13 Oct 2022 • Xue Yang, Gefan Zhang, Wentong Li, Xuehui Wang, Yue Zhou, Junchi Yan

Oriented object detection emerges in many applications from aerial images to autonomous driving, while many existing detection benchmarks are annotated with horizontal bounding box only which is also less costive than fine-grained rotated box, leading to a gap between the readily available training corpus and the rising demand for oriented object detection.

Autonomous Driving Box-supervised Instance Segmentation +6

Paper
Code

Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization

1 code implementation • 22 Sep 2022 • Xue Yang, Gefan Zhang, Xiaojiang Yang, Yue Zhou, Wentao Wang, Jin Tang, Tao He, Junchi Yan

Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects and an additional rotation angle parameter is used for rotated objects.

regression

1,056

Paper
Code

Black-box Dataset Ownership Verification via Backdoor Watermarking

1 code implementation • 4 Aug 2022 • Yiming Li, Mingyan Zhu, Xue Yang, Yong Jiang, Tao Wei, Shu-Tao Xia

The rapid development of DNNs has benefited from the existence of some high-quality datasets ($e. g.$, ImageNet), which allow researchers and developers to easily verify the performance of their methods.

Paper
Code

G-Rep: Gaussian Representation for Arbitrary-Oriented Object Detection

1 code implementation • 24 May 2022 • Liping Hou, Ke Lu, Xue Yang, Yuqiu Li, Jian Xue

To go further, in this paper, we propose a unified Gaussian representation called G-Rep to construct Gaussian distributions for OBB, QBB, and PointSet, which achieves a unified solution to various representations and problems.

Object object-detection +3

1,726

Paper
Code

MMRotate: A Rotated Object Detection Benchmark using PyTorch

1 code implementation • 28 Apr 2022 • Yue Zhou, Xue Yang, Gefan Zhang, Jiabao Wang, Yanyi Liu, Liping Hou, Xue Jiang, Xingzhao Liu, Junchi Yan, Chengqi Lyu, Wenwei Zhang, Kai Chen

We present an open-source toolbox, named MMRotate, which provides a coherent algorithm framework of training, inferring, and evaluation for the popular rotated object detection algorithm based on deep learning.

Object object-detection +1

1,726

Paper
Code

Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation

no code implementations • 25 Mar 2022 • Xue Yang, Changchun Bao

Various network architectures, from traditional convolutional neural network (CNN) and recurrent neural network (RNN) to advanced transformer, have been designed sophistically to improve separation performance.

Computational Efficiency Speech Separation

Paper
Add Code

Self-supervised Implicit Glyph Attention for Text Recognition

1 code implementation • CVPR 2023 • Tongkun Guan, Chaochen Gu, Jingzheng Tu, Xue Yang, Qi Feng, Yudi Zhao, Xiaokang Yang, Wei Shen

Supervised attention can alleviate the above issue, but it is character category-specific, which requires extra laborious character-level bounding box annotations and would be memory-intensive when handling languages with larger character categories.

Ranked #2 on Scene Text Recognition on ICDAR 2003

Scene Text Recognition Text Segmentation

101

Paper
Code

The KFIoU Loss for Rotated Object Detection

3 code implementations • 29 Jan 2022 • Xue Yang, Yue Zhou, Gefan Zhang, Jirui Yang, Wentao Wang, Junchi Yan, Xiaopeng Zhang, Qi Tian

This is in contrast to recent Gaussian modeling based rotation detectors e. g. GWD loss and KLD loss that involve a human-specified distribution distance metric which require additional hyperparameter tuning that vary across datasets and detectors.

Object object-detection +1

1,726

Paper
Code

Dual-Path Image Inpainting With Auxiliary GAN Inversion

no code implementations • CVPR 2022 • Wentao Wang, Li Niu, Jianfu Zhang, Xue Yang, Liqing Zhang

Different from feed-forward methods, they seek for a closest latent code to the corrupted image and feed it to a pretrained generator.

Image Inpainting

Paper
Add Code

AlphaRotate: A Rotation Detection Benchmark using TensorFlow

1 code implementation • 12 Nov 2021 • Xue Yang, Yue Zhou, Junchi Yan

AlphaRotate is an open-source Tensorflow benchmark for performing scalable rotation detection on various datasets.

1,056

Paper
Code

RSDet++: Point-based Modulated Loss for More Accurate Rotated Object Detection

1 code implementation • 24 Sep 2021 • Wen Qian, Xue Yang, Silong Peng, Junchi Yan, Xiujuan Zhang

We classify the discontinuity of loss in both five-param and eight-param rotated object detection methods as rotation sensitivity error (RSE) which will result in performance degeneration.

Object object-detection +1

1,056

Paper
Code

An adaptive Origin-Destination flows cluster-detecting method to identify urban mobility trends

no code implementations • 10 Jun 2021 • Mengyuan Fang, Luliang Tang, Zihan Kan, Xue Yang, Tao Pei, Qingquan Li, Chaokui Li

As an important spatial analysis approach, the clustering methods of point events have been extended to OD flows to identify the dominant trends and spatial structures of urban mobility.

Clustering

Paper
Add Code

Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence

2 code implementations • NeurIPS 2021 • Xue Yang, Xiaojiang Yang, Jirui Yang, Qi Ming, Wentao Wang, Qi Tian, Junchi Yan

Taking the perspective that horizontal detection is a special case for rotated object detection, in this paper, we are motivated to change the design of rotation regression loss from induction paradigm to deduction methodology, in terms of the relation between rotation and horizontal detection.

Ranked #14 on Object Detection In Aerial Images on DOTA (using extra training data)

object-detection Object Detection In Aerial Images +1

1,726

Paper
Code

Optimization for Arbitrary-Oriented Object Detection via Representation Invariance Loss

1 code implementation • 22 Mar 2021 • Qi Ming, Lingjuan Miao, Zhiqiang Zhou, Xue Yang, Yunpeng Dong

In this paper, we propose a Representation Invariance Loss (RIL) to optimize the bounding box regression for the rotating objects.

Ranked #27 on Object Detection In Aerial Images on DOTA (using extra training data)

Object object-detection +3

Paper
Code

Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss

2 code implementations • 28 Jan 2021 • Xue Yang, Junchi Yan, Qi Ming, Wentao Wang, Xiaopeng Zhang, Qi Tian

Boundary discontinuity and its inconsistency to the final detection metric have been the bottleneck for rotating detection regression loss design.

Ranked #16 on Object Detection In Aerial Images on DOTA (using extra training data)

object-detection Object Detection In Aerial Images +2

1,726

Paper
Code

Parallel Multi-Resolution Fusion Network for Image Inpainting

no code implementations • ICCV 2021 • Wentao Wang, Jianfu Zhang, Li Niu, Haoyu Ling, Xue Yang, Liqing Zhang

Conventional deep image inpainting methods are based on auto-encoder architecture, in which the spatial details of images will be lost in the down-sampling process, leading to the degradation of generated results.

Image Inpainting

Paper
Add Code

Dense Label Encoding for Boundary Discontinuity Free Rotation Detection

3 code implementations • CVPR 2021 • Xue Yang, Liping Hou, Yue Zhou, Wentao Wang, Junchi Yan

Rotation detection serves as a fundamental building block in many visual applications involving aerial image, scene text, and face etc.

Ranked #29 on Object Detection In Aerial Images on DOTA (using extra training data)

Classification General Classification +2

1,056

Paper
Code

Rectified Decision Trees: Exploring the Landscape of Interpretable and Effective Machine Learning

no code implementations • 21 Aug 2020 • Yiming Li, Jiawang Bai, Jiawei Li, Xue Yang, Yong Jiang, Shu-Tao Xia

Interpretability and effectiveness are two essential and indispensable requirements for adopting machine learning methods in reality.

BIG-bench Machine Learning Knowledge Distillation

Paper
Add Code

SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing

5 code implementations • 28 Apr 2020 • Xue Yang, Junchi Yan, Wenlong Liao, Xiaokang Yang, Jin Tang, Tao He

Instance-level denoising on the feature map is performed to enhance the detection to small and cluttered objects.

Ranked #33 on Object Detection In Aerial Images on DOTA (using extra training data)

Denoising object-detection +1

765

Paper
Code

On the Arbitrary-Oriented Object Detection: Classification based Approaches Revisited

4 code implementations • ECCV 2020 • Xue Yang, Junchi Yan

For the resulting circularly distributed angle classification problem, we first devise a Circular Smooth Label technique to handle the periodicity of angle and increase the error tolerance to adjacent angles.

Ranked #37 on Object Detection In Aerial Images on DOTA (using extra training data)

Classification General Classification +4

1,724

Paper
Code

An Accuracy-Lossless Perturbation Method for Defending Privacy Attacks in Federated Learning

1 code implementation • 23 Feb 2020 • Xue Yang, Yan Feng, Weijun Fang, Jun Shao, Xiaohu Tang, Shu-Tao Xia, Rongxing Lu

However, the strong defence ability and high learning accuracy of these schemes cannot be ensured at the same time, which will impede the wide application of FL in practice (especially for medical or financial institutions that require both high accuracy and strong privacy guarantee).

Federated Learning

Paper
Code

Learning Modulated Loss for Rotated Object Detection

2 code implementations • 19 Nov 2019 • Wen Qian, Xue Yang, Silong Peng, Yue Guo, Junchi Yan

Popular rotated detection methods usually use five parameters (coordinates of the central point, width, height, and rotation angle) to describe the rotated bounding box and l1-loss as the loss function.

Ranked #43 on Object Detection In Aerial Images on DOTA (using extra training data)

Object object-detection +2

Paper
Code

Building Change Detection for Remote Sensing Images Using a Dual Task Constrained Deep Siamese Convolutional Network Model

no code implementations • 17 Sep 2019 • Yi Liu, Chao Pang, Zongqian Zhan, Xiaomeng Zhang, Xue Yang

In recent years, building change detection methods have made great progress by introducing deep learning, but they still suffer from the problem of the extracted features not being discriminative enough, resulting in incomplete regions and irregular boundaries.

Building change detection for remote sensing images Change Detection +3

Paper
Add Code

R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object

10 code implementations • 15 Aug 2019 • Xue Yang, Junchi Yan, Ziming Feng, Tao He

Considering the shortcoming of feature misalignment in existing refined single-stage detector, we design a feature refinement module to improve detection performance by getting more accurate features.

object-detection Object Detection In Aerial Images

1,726

Paper
Code

Rethinking Classification and Localization for Cascade R-CNN

no code implementations • 27 Jul 2019 • Ang Li, Xue Yang, Chongyang Zhang

We extend the state-of-the-art Cascade R-CNN with a simple feature sharing mechanism.

Classification General Classification +2

Paper
Add Code

Comparison Network for One-Shot Conditional Object Detection

no code implementations • 4 Apr 2019 • Tengfei Zhang, Yue Zhang, Xian Sun, Hao Sun, Menglong Yan, Xue Yang, Kun fu

A two-stage detector for OSCD is introduced to compare the extracted query and target features with the learnable metric to approach the optimized non-linear conditional probability.

Object object-detection +1

Paper
Add Code

Multinomial Random Forest: Toward Consistency and Privacy-Preservation

no code implementations • 10 Mar 2019 • Yiming Li, Jiawang Bai, Jiawei Li, Xue Yang, Yong Jiang, Chun Li, Shu-Tao Xia

Despite the impressive performance of random forests (RF), its theoretical properties have not been thoroughly understood.

General Classification

Paper
Add Code

SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects

3 code implementations • ICCV 2019 • Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo, Sun Xian, Kun fu

Specifically, a sampling fusion network is devised which fuses multi-layer feature with effective anchor sampling, to improve the sensitivity to small objects.

Ranked #47 on Object Detection In Aerial Images on DOTA (using extra training data)

object-detection Object Detection In Aerial Images

311

Paper
Code

Position Detection and Direction Prediction for Arbitrary-Oriented Ships via Multitask Rotation Region Convolutional Neural Network

3 code implementations • 13 Jun 2018 • Xue Yang, Hao Sun, Xian Sun, Menglong Yan, Zhi Guo, Kun fu

The complexity of application scenarios, the redundancy of detection region, and the difficulty of dense ship detection are all the main obstacles that limit the successful operation of traditional methods in ship detection.

Position

589

Paper
Code

Automatic Ship Detection of Remote Sensing Images from Google Earth in Complex Scenes Based on Multi-Scale Rotation Dense Feature Pyramid Networks

4 code implementations • 12 Jun 2018 • Xue Yang, Hao Sun, Kun fu, Jirui Yang, Xian Sun, Menglong Yan, Zhi Guo

Additionally, in the case of ship rotation and dense arrangement, we design a rotation anchor strategy to predict the minimum circumscribed rectangle of the object so as to reduce the redundant detection region and improve the recall.

object-detection Object Detection

589

Paper
Code

Sequence-based Multimodal Apprenticeship Learning For Robot Perception and Decision Making

no code implementations • 24 Feb 2017 • Fei Han, Xue Yang, Yu Zhang, Hao Zhang

Apprenticeship learning has recently attracted a wide attention due to its capability of allowing robots to learn physical tasks directly from demonstrations provided by human experts.

Decision Making

Paper
Add Code

Simultaneous Feature and Body-Part Learning for Real-Time Robot Awareness of Human Behaviors

no code implementations • 24 Feb 2017 • Fei Han, Xue Yang, Christopher Reardon, Yu Zhang, Hao Zhang

We formulate FABL as a regression-like optimization problem with structured sparsity-inducing norms to model interrelationships of body parts and features.

Paper
Add Code

Enforcing Template Representability and Temporal Consistency for Adaptive Sparse Tracking

no code implementations • 30 Apr 2016 • Xue Yang, Fei Han, Hua Wang, Hao Zhang

Sparse representation has been widely studied in visual tracking, which has shown promising tracking performance.

Descriptive Visual Tracking

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.