no code implementations • 10 Apr 2023 • Daniel Ortega, Chia-Yu Li, Ngoc Thang Vu
This paper presents our latest investigation on modeling backchannel in conversations.
no code implementations • 20 Oct 2022 • Chia-Yu Li, Ngoc Thang Vu
In this paper, we exploit the advantages from both inter-domain loss and CycleGAN to achieve better shared representation of unpaired speech and text inputs and thus improve the speech-to-text mapping.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 19 Dec 2021 • Chia-Yu Li, Ngoc Thang Vu
Code-Switching (CS) is a common linguistic phenomenon in multilingual communities that consists of switching between languages while speaking.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 12 Dec 2021 • Chia-Yu Li, Ngoc Thang Vu
This paper presents our latest effort on improving Code-switching language models that suffer from data scarcity.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 12 Dec 2021 • Chia-Yu Li, Ngoc Thang Vu
This paper presents our latest investigations on improving automatic speech recognition for noisy speech via speech enhancement.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 29 Aug 2021 • Injy Hamed, Pavel Denisov, Chia-Yu Li, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu
In this paper, we present our work on code-switched Egyptian Arabic-English automatic speech recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • ACL 2020 • Chia-Yu Li, Daniel Ortega, Dirk Väth, Florian Lux, Lindsey Vanderlyn, Maximilian Schmidt, Michael Neumann, Moritz Völkel, Pavel Denisov, Sabrina Jenne, Zorica Kacarevic, Ngoc Thang Vu
We present ADVISER - an open-source, multi-domain dialog system toolkit that enables the development of multi-modal (incorporating speech, text and vision), socially-engaged (e. g. emotion recognition, engagement level prediction and backchanneling) conversational agents.
no code implementations • 28 Feb 2019 • Daniel Ortega, Chia-Yu Li, Gisela Vallejo, Pavel Denisov, Ngoc Thang Vu
This paper presents our latest investigations on dialog act (DA) classification on automatically generated transcriptions.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4