1 code implementation • 29 Jan 2024 • Zhemin Zhang, Xun Gong
Specifically, we create a conditional Gaussian distribution for each class and then sample multiple sub-centers from that distribution to extend the linear classifier.
no code implementations • 10 Nov 2023 • Zhemin Zhang, Xun Gong
Inspired by one of the most successful transformers-based models for NLP: Big Bird, we propose a novel sparse attention mechanism for Vision Transformers (ViT).
no code implementations • 29 Sep 2023 • Zhen Liu, Hang Gao, Hao Ma, Shuo Cai, Yunfeng Hu, Ting Qu, Hong Chen, Xun Gong
Autonomous vehicle (AV) evaluation has been the subject of increased interest in recent years both in industry and in academia.
no code implementations • 7 Aug 2023 • Xun Gong, Xiaozhe Wang, Bo Cao
Modern power grids are fast evolving with the increasing volatile renewable generation, distributed energy resources (DERs) and time-varying operating conditions.
no code implementations • 18 May 2023 • Hang Shao, Wei Wang, Bei Liu, Xun Gong, Haoyu Wang, Yanmin Qian
Due to the rapid development of computing hardware resources and the dramatic growth of data, pre-trained models in speech recognition, such as Whisper, have significantly improved the performance of speech recognition tasks.
no code implementations • 13 Apr 2023 • Zhemin Zhang, Xun Gong
Recently, Transformers have shown promising performance in various vision tasks.
no code implementations • 17 Nov 2022 • Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian
This motivates us to leverage the factorized neural transducer structure, containing a real language model, the vocabulary predictor.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • 10 Oct 2022 • Shijie Wu, Xun Gong
Specifically, a closed-set noise label self-correction module is put forward, making this framework work well on datasets containing a lot of label noise.
1 code implementation • 30 Sep 2022 • Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, LiRong Dai, Jinyu Li, Furu Wei
In this paper, we propose a cross-modal Speech and Language Model (SpeechLM) to explicitly align speech and text pre-training with a pre-defined unified discrete representation.
no code implementations • 19 Sep 2022 • Zhemin Zhang, Xun Gong
Recently, Transformers have shown promising performance in various vision tasks.
no code implementations • 11 Jul 2022 • Xun Gong, Xiaozhe Wang, Geza Joos
Low inertia, nonlinearity and a high level of uncertainty (varying topologies and operating conditions) pose challenges to microgrid (MG) systemwide operation.
no code implementations • 15 Jun 2022 • Jinyi Wu, Xun Gong, Zhemin Zhang
To verify the effectiveness of SSIA, we performed a particular implementation (called an SSIA block) in convolutional neural network models and validated it on several image classification datasets.
no code implementations • 10 Jun 2022 • Zhemin Zhang, Xun Gong
Positional encoding is important for vision transformer (ViT) to capture the spatial structure of the input image.
no code implementations • 30 Mar 2022 • Zhemin Zhang, Xun Gong, Jinyi Wu
In this way, ReplaceBlock can effectively simulate the feature map of the occluded image.
no code implementations • 24 Mar 2022 • Zhemin Zhang, Xun Gong
The F-SC specifically, first samples a class center Ui for each class from a uniform distribution, and then generates a normal distribution for each class, where the mean is equal to Ui.
no code implementations • 29 Sep 2021 • Jiawei Liu, Hang Gao, Yunfeng Hu, Xun Gong
The proxy dataset selection stage calculates the proposed average patch saliency (APS) of each available dataset to select a high-APS proxy dataset that can guarantee patches' fooling abilities.
no code implementations • 8 Jul 2020 • Hassan Yazdani, Azadeh Vosoughi, Xun Gong
We establish a lower bound on the achievable rates of SUtx-SUrx link, in the presence of spectrum sensing and channel estimation errors, and errors due to incorrect detection of the beam corresponding to PU's location and incorrect selection of the strongest beam for data transmission.
no code implementations • 6 Mar 2018 • Shengdong Du, Tianrui Li, Xun Gong, Shi-Jinn Horng
Traffic flow forecasting has been regarded as a key problem of intelligent transport systems.
no code implementations • 22 Nov 2017 • Zeng Yu, Tianrui Li, Ning Yu, Xun Gong, Ke Chen, Yi Pan
This paper aims to develop a new architecture that can make full use of the feature maps of convolutional networks.