no code implementations • 31 Jul 2023 • Triantafyllos Kefalas, Yannis Panagakis, Maja Pantic
The implicit assumption of this task is that the sound signal is either missing or contains a high amount of noise/corruption such that it is not useful for processing.
no code implementations • 27 Jun 2023 • Triantafyllos Kefalas, Yannis Panagakis, Maja Pantic
Most established approaches to date involve a two-step process, whereby an intermediate representation from the video, such as a spectrogram, is extracted first and then passed to a vocoder to produce the raw audio.
no code implementations • 12 Dec 2019 • Triantafyllos Kefalas, Konstantinos Vougioukas, Yannis Panagakis, Stavros Petridis, Jean Kossaifi, Maja Pantic
Speech-driven facial animation involves using a speech signal to generate realistic videos of talking faces.