Search Results for author: Xun Gong

Found 19 papers, 3 papers with code

Generating Multi-Center Classifier via Conditional Gaussian Distribution

1 code implementation • 29 Jan 2024 • Zhemin Zhang, Xun Gong

Specifically, we create a conditional Gaussian distribution for each class and then sample multiple sub-centers from that distribution to extend the linear classifier.

Image Classification

Paper
Code

Vision Big Bird: Random Sparsification for Full Attention

no code implementations • 10 Nov 2023 • Zhemin Zhang, Xun Gong

Inspired by one of the most successful transformers-based models for NLP: Big Bird, we propose a novel sparse attention mechanism for Vision Transformers (ViT).

Paper
Add Code

Adversarial Driving Behavior Generation Incorporating Human Risk Cognition for Autonomous Vehicle Evaluation

no code implementations • 29 Sep 2023 • Zhen Liu, Hang Gao, Hao Ma, Shuo Cai, Yunfeng Hu, Ting Qu, Hong Chen, Xun Gong

Autonomous vehicle (AV) evaluation has been the subject of increased interest in recent years both in industry and in academia.

Reinforcement Learning (RL)

Paper
Add Code

On Data-Driven Modeling and Control in Modern Power Grids Stability: Survey and Perspective

no code implementations • 7 Aug 2023 • Xun Gong, Xiaozhe Wang, Bo Cao

Modern power grids are fast evolving with the increasing volatile renewable generation, distributed energy resources (DERs) and time-varying operating conditions.

Paper
Add Code

Whisper-KDQ: A Lightweight Whisper via Guided Knowledge Distillation and Quantization for Efficient ASR

no code implementations • 18 May 2023 • Hang Shao, Wei Wang, Bei Liu, Xun Gong, Haoyu Wang, Yanmin Qian

Due to the rapid development of computing hardware resources and the dramatic growth of data, pre-trained models in speech recognition, such as Whisper, have significantly improved the performance of speech recognition tasks.

Knowledge Distillation Quantization +2

Paper
Add Code

RSIR Transformer: Hierarchical Vision Transformer using Random Sampling Windows and Important Region Windows

no code implementations • 13 Apr 2023 • Zhemin Zhang, Xun Gong

Recently, Transformers have shown promising performance in various vision tasks.

Paper
Add Code

LongFNT: Long-form Speech Recognition with Factorized Neural Transducer

no code implementations • 17 Nov 2022 • Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian

This motivates us to leverage the factorized neural transducer structure, containing a real language model, the vocabulary predictor.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

BoundaryFace: A mining framework with noise label self-correction for Face Recognition

1 code implementation • 10 Oct 2022 • Shijie Wu, Xun Gong

Specifically, a closed-set noise label self-correction module is put forward, making this framework work well on datasets containing a lot of label noise.

Face Recognition

Paper
Code

SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data

1 code implementation • 30 Sep 2022 • Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, LiRong Dai, Jinyu Li, Furu Wei

In this paper, we propose a cross-modal Speech and Language Model (SpeechLM) to explicitly align speech and text pre-training with a pre-defined unified discrete representation.

Language Modelling speech-recognition +1

1,021

Paper
Code

Axially Expanded Windows for Local-Global Interaction in Vision Transformers

no code implementations • 19 Sep 2022 • Zhemin Zhang, Xun Gong

Recently, Transformers have shown promising performance in various vision tasks.

Paper
Add Code

An Online Data-Driven Method for Microgrid Secondary Voltage and Frequency Control with Ensemble Koopman Modeling

no code implementations • 11 Jul 2022 • Xun Gong, Xiaozhe Wang, Geza Joos

Low inertia, nonlinearity and a high level of uncertainty (varying topologies and operating conditions) pose challenges to microgrid (MG) systemwide operation.

Event Detection

Paper
Add Code

Self-Supervised Implicit Attention: Guided Attention by The Model Itself

no code implementations • 15 Jun 2022 • Jinyi Wu, Xun Gong, Zhemin Zhang

To verify the effectiveness of SSIA, we performed a particular implementation (called an SSIA block) in convolutional neural network models and validated it on several image classification datasets.

Image Classification Self-Supervised Learning

Paper
Add Code

Positional Label for Self-Supervised Vision Transformer

no code implementations • 10 Jun 2022 • Zhemin Zhang, Xun Gong

Positional encoding is important for vision transformer (ViT) to capture the spatial structure of the input image.

Position

Paper
Add Code

ReplaceBlock: An improved regularization method based on background information

no code implementations • 30 Mar 2022 • Zhemin Zhang, Xun Gong, Jinyi Wu

In this way, ReplaceBlock can effectively simulate the feature map of the occluded image.

Object

Paper
Add Code

The Fixed Sub-Center: A Better Way to Capture Data Complexity

no code implementations • 24 Mar 2022 • Zhemin Zhang, Xun Gong

The F-SC specifically, first samples a class center Ui for each class from a uniform distribution, and then generates a normal distribution for each class, where the mean is equal to Ui.

Image Classification

Paper
Add Code

A Two-Stage Data-Free Adversarial Patch Generation Framework

no code implementations • 29 Sep 2021 • Jiawei Liu, Hang Gao, Yunfeng Hu, Xun Gong

The proxy dataset selection stage calculates the proposed average patch saliency (APS) of each available dataset to select a high-APS proxy dataset that can guarantee patches' fooling abilities.

Vocal Bursts Valence Prediction

Paper
Add Code

Achievable Rates of Opportunistic Cognitive Radio Systems Using Reconfigurable Antennas with Imperfect Sensing and Channel Estimation

no code implementations • 8 Jul 2020 • Hassan Yazdani, Azadeh Vosoughi, Xun Gong

We establish a lower bound on the achievable rates of SUtx-SUrx link, in the presence of spectrum sensing and channel estimation errors, and errors due to incorrect detection of the beam corresponding to PU's location and incorrect selection of the strongest beam for data transmission.

Paper
Add Code

A Hybrid Method for Traffic Flow Forecasting Using Multimodal Deep Learning

no code implementations • 6 Mar 2018 • Shengdong Du, Tianrui Li, Xun Gong, Shi-Jinn Horng

Traffic flow forecasting has been regarded as a key problem of intelligent transport systems.

Multimodal Deep Learning

Paper
Add Code

Three-Stream Convolutional Networks for Video-based Person Re-Identification

no code implementations • 22 Nov 2017 • Zeng Yu, Tianrui Li, Ning Yu, Xun Gong, Ke Chen, Yi Pan

This paper aims to develop a new architecture that can make full use of the feature maps of convolutional networks.

Video-Based Person Re-Identification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.