Keyword Spotting

96 papers with code • 10 benchmarks • 8 datasets

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Benchmarks

Add a Result

These leaderboards are used to track progress in Keyword Spotting

Dataset	Best Model	Compare
QUESST	ELiRF Fusion+Length(All Queries)	See all
Google Speech Commands	TripletLoss-res15	See all
hey Siri	HEiMDaL	See all
TensorFlow	TensorFlow's model version 2	See all
TAU Urban Acoustic Scenes 2019	CP-ResNet(ch64) w/ SSN(S=2, A=Sub)	See all
VoxForge	1D-ConvNet	See all
FKD	Res26	See all
Google Speech Commands V2 12	MicroNet-KWS-L	See all
Google Speech Commands V2 35	QuaternionNeuralNetwork	See all
Google Speech Commands (v2)	Quaternion Neural Networks	See all

Libraries

Use these libraries to find Keyword Spotting models and implementations

PaddlePaddle/PaddleSpeech

2 papers

10,168

holgerbovbjerg/data2vec-kws

2 papers

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

MLPerf Tiny Benchmark

mlcommons/tiny • • 14 Jun 2021

Advancements in ultra-low-power tiny machine learning (TinyML) systems promise to unlock an entirely new class of smart applications.

Paper
Code

SSAST: Self-Supervised Audio Spectrogram Transformer

YuanGongND/ssast • • 19 Oct 2021

However, pure Transformer models tend to require more training data compared to CNNs, and the success of the AST relies on supervised pretraining that requires a large amount of labeled data and a complex training pipeline, thus limiting the practical usage of AST.

Paper
Code

Progressive Continual Learning for Spoken Keyword Spotting

huangyz0918/kws-continual-learning • • 29 Jan 2022

Catastrophic forgetting is a thorny challenge when updating keyword spotting (KWS) models after deployment.

Paper
Code

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

PaddlePaddle/PaddleSpeech • • NAACL (ACL) 2022

PaddleSpeech is an open-source all-in-one speech toolkit.

Paper
Code

Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining

holgerbovbjerg/data2vec-kws • • 4 Oct 2022

This paper explores the effectiveness of SSL on small models for KWS and establishes that SSL can enhance the performance of small KWS models when labelled data is scarce.

Paper
Code

What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision

malmaud/whats_cookin • 5 Mar 2015

We present a novel method for aligning a sequence of instructions to a video of someone carrying out a task.

Paper
Code

What’s Cookin’? Interpreting Cooking Videos using Text, Speech and Vision

malmaud/whats_cookin • HLT 2015

Paper
Code

GTM-UVigo Systems for the Query-by-Example Search on Speech Task at MediaEval 2015

gtm-uvigo/MediaEval_QUESST2015 • MediaEval 2015 Workshop 2015

In this paper, we present the systems developed by GTMUVigo team for the query by example search on speech task (QUESST) at MediaEval 2015.

Paper
Code

Zero-shot keyword spotting for visual speech recognition in-the-wild

lilianemomeni/KWS-Net • • ECCV 2018

Visual keyword spotting (KWS) is the problem of estimating whether a text query occurs in a given recording using only video information.

Paper
Code

Federated Learning for Keyword Spotting

snipsco/keyword-spotting-research-datasets • 9 Oct 2018

We propose a practical approach based on federated learning to solve out-of-domain issues with continuously running embedded speech-based models such as wake word detectors.

Paper
Code

Keyword Spotting

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result