Search Results for author: Weilong Huang

Found 3 papers, 1 papers with code

Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information

2 code implementations28 Nov 2021 Zhihao Du, Shiliang Zhang, Siqi Zheng, Weilong Huang, Ming Lei

In this paper, we reformulate this task as a single-label prediction problem by encoding the multi-speaker labels with power set.

Action Detection Activity Detection +2

BeamTransformer: Microphone Array-based Overlapping Speech Detection

no code implementations9 Sep 2021 Siqi Zheng, Shiliang Zhang, Weilong Huang, Qian Chen, Hongbin Suo, Ming Lei, Jinwei Feng, Zhijie Yan

We propose BeamTransformer, an efficient architecture to leverage beamformer's edge in spatial filtering and transformer's capability in context sequence modeling.

A Real-time Speaker Diarization System Based on Spatial Spectrum

no code implementations20 Jul 2021 Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan

In this paper we describe a speaker diarization system that enables localization and identification of all speakers present in a conversation or meeting.

speaker-diarization Speaker Diarization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.