Browse State-of-the-Art
Datasets
Methods
More
Newsletter
RC2022
About
Trends
Portals
Libraries
Sign In
Subscribe to the PwC Newsletter
×
Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets.
Read previous issues
Join the community
×
You need to
log in
to edit.
You can
create a new account
if you don't have one.
Or, discuss a change on
Slack
.
Browse SoTA
> Speech
Speech
498 benchmarks • 78 tasks • 222 datasets • 2885 papers with code
Text Generation
Text Generation
274 benchmarks
1453 papers with code
Dialogue Generation
15 benchmarks
230 papers with code
Data-to-Text Generation
42 benchmarks
103 papers with code
Multi-Document Summarization
5 benchmarks
93 papers with code
Text Style Transfer
3 benchmarks
80 papers with code
See all 26 tasks
Speech Recognition
Speech Recognition
359 benchmarks
1078 papers with code
Automatic Speech Recognition (ASR)
7 benchmarks
478 papers with code
Visual Speech Recognition
10 benchmarks
40 papers with code
Robust Speech Recognition
22 papers with code
Distant Speech Recognition
2 benchmarks
10 papers with code
See all 11 tasks
Speech Emotion Recognition
Vocal Bursts Intensity Prediction
1 benchmark
762 papers with code
Vocal Bursts Valence Prediction
1 benchmark
260 papers with code
Vocal Bursts Type Prediction
1 benchmark
155 papers with code
Cultural Vocal Bursts Intensity Prediction
1 benchmark
93 papers with code
Emotion Recognition
Emotion Recognition
54 benchmarks
441 papers with code
Speech Emotion Recognition
18 benchmarks
95 papers with code
Emotion Recognition in Conversation
12 benchmarks
67 papers with code
Multimodal Emotion Recognition
3 benchmarks
48 papers with code
Emotion-Cause Pair Extraction
2 benchmarks
17 papers with code
See all 13 tasks
Dialogue
Dialogue Generation
15 benchmarks
230 papers with code
Dialogue State Tracking
7 benchmarks
123 papers with code
Task-Oriented Dialogue Systems
6 benchmarks
117 papers with code
Visual Dialog
8 benchmarks
53 papers with code
Dialogue Understanding
18 benchmarks
29 papers with code
See all 22 tasks
Conformal Prediction
141 papers with code
Text Simplification
11 benchmarks
115 papers with code
Music Source Separation
3 benchmarks
53 papers with code
Audio Source Separation
8 benchmarks
44 papers with code
Decision Making Under Uncertainty
42 papers with code
See all 9 tasks
Chatbot
Dialogue Generation
15 benchmarks
230 papers with code
Chatbot
15 benchmarks
162 papers with code
Speech Synthesis
Speech Synthesis
16 benchmarks
286 papers with code
Expressive Speech Synthesis
11 papers with code
Emotional Speech Synthesis
3 papers with code
text-to-speech translation
2 papers with code
Speech Synthesis - Assamese
1 benchmark
1 papers with code
See all 16 tasks
Speech Enhancement
Speech Enhancement
17 benchmarks
214 papers with code
Speech Dereverberation
4 benchmarks
16 papers with code
Bandwidth Extension
1 benchmark
14 papers with code
Packet Loss Concealment
4 papers with code
Speech Intelligibility Evaluation
Speaker Verification
Speaker Verification
5 benchmarks
169 papers with code
Text-Independent Speaker Verification
17 papers with code
Text-Dependent Speaker Verification
2 papers with code
Voice Conversion
Voice Conversion
2 benchmarks
147 papers with code
Spoken Language Understanding
Spoken Language Understanding
17 benchmarks
114 papers with code
Spoken language identification
12 benchmarks
11 papers with code
Keyword Spotting
Keyword Spotting
13 benchmarks
92 papers with code
Small-Footprint Keyword Spotting
7 papers with code
Visual Keyword Spotting
3 benchmarks
4 papers with code
Speech Separation
Speech Separation
19 benchmarks
94 papers with code
Speech Extraction
1 benchmark
7 papers with code
Audio Generation
Audio Generation
7 benchmarks
60 papers with code
Voice Cloning
17 papers with code
Audio Super-Resolution
4 benchmarks
13 papers with code
Room Impulse Response (RIR)
9 papers with code
Text-To-Speech Synthesis
Text-To-Speech Synthesis
7 benchmarks
90 papers with code
Prosody Prediction
1 benchmark
2 papers with code
Zero-Shot Multi-Speaker TTS
2 papers with code
Cultural Vocal Bursts Intensity Prediction
Cultural Vocal Bursts Intensity Prediction
1 benchmark
93 papers with code
Speaker Recognition
Speaker Recognition
1 benchmark
89 papers with code
Speaker Diarization
Speaker Diarization
12 benchmarks
73 papers with code
Speaker Identification
Speaker Identification
4 benchmarks
61 papers with code
Audio-Visual Speech Recognition
Audio-Visual Speech Recognition
3 benchmarks
27 papers with code
Speech Denoising
Speech Denoising
2 benchmarks
27 papers with code
Speech-to-Speech Translation
Speech-to-Speech Translation
1 benchmark
26 papers with code
Spoken Dialogue Systems
Spoken Dialogue Systems
19 papers with code
Singing Voice Synthesis
Singing Voice Synthesis
18 papers with code
Speaker Separation
Speaker Separation
11 papers with code
Multi-Speaker Source Separation
6 papers with code
Acoustic echo cancellation
Acoustic echo cancellation
13 papers with code
Acoustic Modelling
Acoustic Modelling
10 papers with code
Pronunciation Assessment
Phone-level pronunciation scoring
1 benchmark
5 papers with code
Utterance-level pronounciation scoring
1 benchmark
2 papers with code
Word-level pronunciation scoring
1 benchmark
2 papers with code
Pronunciation Assessment
Text-Independent Speaker Recognition
Text-Independent Speaker Recognition
6 papers with code
Unsupervised Speech Recognition
Unsupervised Speech Recognition
6 papers with code
Spoken Command Recognition
Spoken Command Recognition
1 benchmark
5 papers with code
Voice Similarity
Voice Similarity
3 papers with code
Manner Of Articulation Detection
Manner Of Articulation Detection
2 papers with code
Speaker Profiling
Speaker Profiling
2 papers with code
Acoustic Question Answering
Acoustic Question Answering
1 papers with code
Speech-to-Gesture Translation
Speech-to-Gesture Translation
1 papers with code
Voice Query Recognition
Voice Query Recognition
1 benchmark
1 papers with code
Speaking Style Synthesis
Speaking Style Synthesis