Search Results for author: Yike Zhang

Found 11 papers, 1 papers with code

SAMSNeRF: Segment Anything Model (SAM) Guides Dynamic Surgical Scene Reconstruction by Neural Radiance Field (NeRF)

no code implementations22 Aug 2023 Ange Lou, Yamin Li, Xing Yao, Yike Zhang, Jack Noble

The accurate reconstruction of surgical scenes from surgical videos is critical for various applications, including intraoperative navigation and image-guided robotic surgery automation.

Depth Estimation Position

Self-supervised Registration and Segmentation of the Ossicles with A Single Ground Truth Label

no code implementations15 Feb 2023 Yike Zhang, Jack Noble

AI-assisted surgeries have drawn the attention of the medical image research community due to their real-world impact on improving surgery success rates.

Image Segmentation Segmentation +1

Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer

no code implementations17 Jan 2023 Zhanheng Yang, Sining Sun, Xiong Wang, Yike Zhang, Long Ma, Lei Xie

In this paper, we propose an efficient approach to obtain a high quality contextual list for a unified streaming/non-streaming based E2E model.

Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR

no code implementations3 Jul 2022 Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma

Then, during the training of the conversational ASR system, the extractor will be frozen to extract the textual representation of preceding speech, while such representation is used as context fed to the ASR decoder through attention mechanism.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Improving CTC-based speech recognition via knowledge transferring from pre-trained language models

1 code implementation22 Feb 2022 Keqi Deng, Songjun Cao, Yike Zhang, Long Ma, Gaofeng Cheng, Ji Xu, Pengyuan Zhang

Recently, end-to-end automatic speech recognition models based on connectionist temporal classification (CTC) have achieved impressive results, especially when fine-tuned from wav2vec2. 0 models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Improving Speech Recognition Accuracy of Local POI Using Geographical Models

no code implementations7 Jul 2021 Songjun Cao, Yike Zhang, Xiaobing Feng, Long Ma

Secondly, a group of geo-specific language models (Geo-LMs) are integrated into our speech recognition system to improve recognition accuracy of long tail and homophone POI.

speech-recognition Speech Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.