Spoken Language Understanding

118 papers with code • 5 benchmarks • 14 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Spoken Language Understanding

Dataset	Best Model	Compare
Fluent Speech Commands	Finstreder (Conformer + AMT, character-based)	See all
Snips-SmartLights	Finstreder (Conformer, character-based)	See all
Snips-SmartSpeaker	Finstreder (Conformer, character-based)	See all
Spoken-SQuAD	ALBERT	See all
Timers and Such	Finstreder (Conformer)	See all

Libraries

Use these libraries to find Spoken Language Understanding models and implementations

espnet/espnet

5 papers

7,903

CoraJung/flexible-input-slu

3 papers

speechbrain/speechbrain

2 papers

7,911

alibaba-damo-academy/FunASR

2 papers

3,393

See all 5 libraries.

Datasets

Subtasks

Spoken language identification

Latest papers

Most implemented Social Latest No code

LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT

alibaba-damo-academy/funcodec • • 7 Oct 2023

In this paper, we propose LauraGPT, a unified GPT model for audio recognition, understanding, and generation.

280

07 Oct 2023

Paper
Code

BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing

cwang621/blsp • • 2 Sep 2023

One is a cascaded approach where outputs (tokens or states) of a separately trained speech recognition system are used as inputs for LLMs, which limits their potential in modeling alignment between speech and text.

02 Sep 2023

Paper
Code

Joint Multiple Intent Detection and Slot Filling with Supervised Contrastive Learning and Self-Distillation

anhtunguyen98/bislu • • 28 Aug 2023

The results also demonstrate the contributions of both bidirectional design and the training method to the accuracy improvement.

28 Aug 2023

Paper
Code

ReCoMIF: Reading comprehension based multi-source information fusion network for Chinese spoken language understanding

1053399472/CAISandSMP • journal 2023

It usually includes slot filling and intent detection (SFID) tasks aiming at semantic parsing of utterances.

01 Aug 2023

Paper
Code

SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?

ashi-ta/speechglue • 14 Jun 2023

Self-supervised learning (SSL) for speech representation has been successfully applied in various downstream tasks, such as speech and speaker recognition.

14 Jun 2023

Paper
Code

ITALIC: An Italian Intent Classification Dataset

rita-nlp/italic • • 14 Jun 2023

Recent large-scale Spoken Language Understanding datasets focus predominantly on English and do not account for language-specific phenomena such as particular phonemes or words in different lects.

14 Jun 2023

Paper
Code

Sequence-Level Knowledge Distillation for Class-Incremental End-to-End Spoken Language Understanding

umbertocappellazzo/slurp-seqkd • 23 May 2023

The ability to learn new concepts sequentially is a major weakness for modern neural networks, which hinders their use in non-stationary environments.

23 May 2023

Paper
Code

Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training

amazon-science/zero-shot-e2e-slu • • 22 May 2023

End-to-end (E2E) spoken language understanding (SLU) is constrained by the cost of collecting speech-semantics pairs, especially when label domains change.

22 May 2023

Paper
Code

Sentence Embedder Guided Utterance Encoder (SEGUE) for Spoken Language Understanding

declare-lab/segue • • 20 May 2023

The pre-trained speech encoder wav2vec 2. 0 performs very well on various spoken language understanding (SLU) tasks.

20 May 2023

Paper
Code

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks

espnet/espnet • • 18 May 2023

Conformer, a convolution-augmented Transformer variant, has become the de facto encoder architecture for speech processing due to its superior performance in various tasks, including automatic speech recognition (ASR), speech translation (ST) and spoken language understanding (SLU).

7,903

18 May 2023

Paper
Code

Spoken Language Understanding

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result