Search Results for author: Xilai Li

Found 12 papers, 5 papers with code

SAMF: Small-Area-Aware Multi-focus Image Fusion for Object Detection

1 code implementation16 Jan 2024 Xilai Li, Xiaosong Li, Haishu Tan, Jinyang Li

Existing multi-focus image fusion (MFIF) methods often fail to preserve the uncertain transition region and detect small focus areas within large defocused regions accurately.

object-detection Object Detection +2

Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image Fusion

1 code implementation3 Nov 2023 Xilai Li, Xiaosong Li, Tao Ye, Xiaoqi Cheng, Wuyang Liu, Haishu Tan

However, the fusion of multiple visible images with different focal regions and infrared images is a unprecedented challenge in real MMIF applications.

Depth Estimation object-detection +1

Masked Audio Text Encoders are Effective Multi-Modal Rescorers

no code implementations11 May 2023 Jinglun Cai, Monica Sunkara, Xilai Li, Anshu Bhatia, Xiao Pan, Sravan Bodapati

Masked Language Models (MLMs) have proven to be effective for second-pass rescoring in Automatic Speech Recognition (ASR) systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR

no code implementations18 Apr 2023 Xilai Li, Goeric Huybrechts, Srikanth Ronanki, Jeff Farris, Sravan Bodapati

Overall, our proposed model reduces the degradation of the streaming mode over the non-streaming full-contextual model from 41. 7% and 45. 7% to 16. 7% and 26. 2% on the LibriSpeech test-clean and test-other datasets respectively, while improving by a relative 15. 5% WER over the previous state-of-the-art unified model.

speech-recognition Speech Recognition

Attentive Normalization

2 code implementations ECCV 2020 Xilai Li, Wei Sun, Tianfu Wu

In state-of-the-art deep neural networks, both feature normalization and feature attention have become ubiquitous.

Image Classification Instance Segmentation +3

Learn to Grow: A Continual Structure Learning Framework for Overcoming Catastrophic Forgetting

no code implementations31 Mar 2019 Xilai Li, Yingbo Zhou, Tianfu Wu, Richard Socher, Caiming Xiong

Addressing catastrophic forgetting is one of the key challenges in continual learning where machine learning systems are trained with sequential or streaming tasks.

Continual Learning Neural Architecture Search +1

AOGNets: Compositional Grammatical Architectures for Deep Learning

4 code implementations CVPR 2019 Xilai Li, Xi Song, Tianfu Wu

This paper presents deep compositional grammatical architectures which harness the best of two worlds: grammar models and DNNs.

Adversarial Defense Image Classification +4

Towards Interpretable R-CNN by Unfolding Latent Structures

1 code implementation14 Nov 2017 Tianfu Wu, Wei Sun, Xilai Li, Xi Song, Bo Li

We focus on weakly-supervised extractive rationale generation, that is learning to unfold latent discriminative part configurations of object instances automatically and simultaneously in detection without using any supervision for part configurations.

object-detection Object Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.