Spoken Language Understanding

118 papers with code • 5 benchmarks • 14 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Spoken Language Understanding models and implementations

LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT

alibaba-damo-academy/funcodec 7 Oct 2023

In this paper, we propose LauraGPT, a unified GPT model for audio recognition, understanding, and generation.

280
07 Oct 2023

BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing

cwang621/blsp 2 Sep 2023

One is a cascaded approach where outputs (tokens or states) of a separately trained speech recognition system are used as inputs for LLMs, which limits their potential in modeling alignment between speech and text.

36
02 Sep 2023

Joint Multiple Intent Detection and Slot Filling with Supervised Contrastive Learning and Self-Distillation

anhtunguyen98/bislu 28 Aug 2023

The results also demonstrate the contributions of both bidirectional design and the training method to the accuracy improvement.

3
28 Aug 2023

ReCoMIF: Reading comprehension based multi-source information fusion network for Chinese spoken language understanding

1053399472/CAISandSMP journal 2023

It usually includes slot filling and intent detection (SFID) tasks aiming at semantic parsing of utterances.

5
01 Aug 2023

SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?

ashi-ta/speechglue 14 Jun 2023

Self-supervised learning (SSL) for speech representation has been successfully applied in various downstream tasks, such as speech and speaker recognition.

13
14 Jun 2023

ITALIC: An Italian Intent Classification Dataset

rita-nlp/italic 14 Jun 2023

Recent large-scale Spoken Language Understanding datasets focus predominantly on English and do not account for language-specific phenomena such as particular phonemes or words in different lects.

10
14 Jun 2023

Sequence-Level Knowledge Distillation for Class-Incremental End-to-End Spoken Language Understanding

umbertocappellazzo/slurp-seqkd 23 May 2023

The ability to learn new concepts sequentially is a major weakness for modern neural networks, which hinders their use in non-stationary environments.

1
23 May 2023

Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training

amazon-science/zero-shot-e2e-slu 22 May 2023

End-to-end (E2E) spoken language understanding (SLU) is constrained by the cost of collecting speech-semantics pairs, especially when label domains change.

6
22 May 2023

Sentence Embedder Guided Utterance Encoder (SEGUE) for Spoken Language Understanding

declare-lab/segue 20 May 2023

The pre-trained speech encoder wav2vec 2. 0 performs very well on various spoken language understanding (SLU) tasks.

6
20 May 2023

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks

espnet/espnet 18 May 2023

Conformer, a convolution-augmented Transformer variant, has become the de facto encoder architecture for speech processing due to its superior performance in various tasks, including automatic speech recognition (ASR), speech translation (ST) and spoken language understanding (SLU).

7,903
18 May 2023