no code implementations • SIGDIAL (ACL) 2021 • Erik Ekstedt, Gabriel Skantze
The ability to take turns in a fluent way (i. e., without long response delays or frequent interruptions) is a fundamental aspect of any spoken dialog system.
no code implementations • 11 Mar 2024 • Koji Inoue, Bing'er Jiang, Erik Ekstedt, Tatsuya Kawahara, Gabriel Skantze
The results show that a monolingual VAP model trained on one language does not make good predictions when applied to other languages.
1 code implementation • 10 Jan 2024 • Koji Inoue, Bing'er Jiang, Erik Ekstedt, Tatsuya Kawahara, Gabriel Skantze
A demonstration of a real-time and continuous turn-taking prediction system is presented.
no code implementations • 29 May 2023 • Erik Ekstedt, Siyang Wang, Éva Székely, Joakim Gustafson, Gabriel Skantze
Turn-taking is a fundamental aspect of human communication where speakers convey their intention to either hold, or yield, their turn through prosodic cues.
no code implementations • 3 May 2023 • Bing'er Jiang, Erik Ekstedt, Gabriel Skantze
Treating the turn-prediction and response-ranking as a one-stage process, our findings suggest that our model can be used as an incremental response ranker, which can be applied in various settings.
no code implementations • 3 May 2023 • Bing'er Jiang, Erik Ekstedt, Gabriel Skantze
Filled pauses (or fillers), such as "uh" and "um", are frequent in spontaneous speech and can serve as a turn-holding cue for the listener, indicating that the current speaker is not done yet.
2 code implementations • SIGDIAL (ACL) 2022 • Erik Ekstedt, Gabriel Skantze
Turn-taking is a fundamental aspect of human communication and can be described as the ability to take turns, project upcoming turn shifts, and supply backchannels at appropriate locations throughout a conversation.
3 code implementations • 19 May 2022 • Erik Ekstedt, Gabriel Skantze
The modeling of turn-taking in dialog can be viewed as the modeling of the dynamics of voice activity of the interlocutors.
1 code implementation • Findings of the Association for Computational Linguistics 2020 • Erik Ekstedt, Gabriel Skantze
Syntactic and pragmatic completeness is known to be important for turn-taking prediction, but so far machine learning models of turn-taking have used such linguistic information in a limited way.