no code implementations • 27 Sep 2023 • Avamarie Brueggeman, Takuya Higuchi, Masood Delfarah, Stephen Shum, Vineet Garg
Our investigation reveals that SE can improve KWS accuracy on noisy speech when the backend model is trained on clean speech; however, despite our extensive exploration, it is difficult to improve the KWS accuracy with SE when the backend is trained on noisy speech.
no code implementations • 27 Sep 2023 • Takuya Higuchi, Avamarie Brueggeman, Masood Delfarah, Stephen Shum
Voice triggering (VT) enables users to activate their devices by just speaking a trigger phrase.