no code implementations • 10 Nov 2020 • Mohamed Mhiri, Samuel Myer, Vikrant Singh Tomar
In recent years, developing a speech understanding system that classifies a waveform to structured data, such as intents and slots, without first transcribing the speech to text has emerged as an interesting research problem.
1 code implementation • 7 Apr 2019 • Loren Lugosch, Mirco Ravanelli, Patrick Ignoto, Vikrant Singh Tomar, Yoshua Bengio
Whereas conventional spoken language understanding (SLU) systems map speech to text, and then text to intent, end-to-end SLU systems map speech directly to intent through a single trainable model.
Ranked #15 on Spoken Language Understanding on Fluent Speech Commands (using extra training data)
1 code implementation • 26 Nov 2018 • Loren Lugosch, Samuel Myer, Vikrant Singh Tomar
Keyword spotting--or wakeword detection--is an essential feature for hands-free operation of modern voice-controlled devices.
no code implementations • 19 Jun 2016 • Vikrant Singh Tomar, Richard C. Rose
In this framework, the parameters of the network are optimized to preserve underlying manifold based relationships between speech feature vectors while minimizing a measure of loss between network outputs and targets.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1