Search Results for author: Konstantinos Vougioukas

Found 11 papers, 4 papers with code

Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models

1 code implementation15 May 2023 Antoni Bigata Casademunt, Rodrigo Mira, Nikita Drobyshev, Konstantinos Vougioukas, Stavros Petridis, Maja Pantic

Speech-driven animation has gained significant traction in recent years, with current methods achieving near-photorealistic results.

Face Generation

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

no code implementations CVPR 2023 Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolář, Stavros Petridis, Maja Pantic, Christian Fuegen

Furthermore, when combined with large-scale pseudo-labeled audio-visual data SynthVSR yields a new state-of-the-art VSR WER of 16. 9% using publicly available data only, surpassing the recent state-of-the-art approaches trained with 29 times more non-public machine-transcribed video data (90, 000 hours).

Lip Reading speech-recognition +1

Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

no code implementations6 Jan 2023 Michał Stypułkowski, Konstantinos Vougioukas, Sen He, Maciej Zięba, Stavros Petridis, Maja Pantic

Talking face generation has historically struggled to produce head movements and natural facial expressions without guidance from additional reference videos.

Talking Face Generation Video Generation

End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks

no code implementations27 Apr 2021 Rodrigo Mira, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Björn W. Schuller, Maja Pantic

In this work, we propose a new end-to-end video-to-speech model based on Generative Adversarial Networks (GANs) which translates spoken video to waveform end-to-end without using any intermediate representation or separate waveform synthesis algorithm.

Lip Reading Speech Synthesis

DINO: A Conditional Energy-Based GAN for Domain Translation

1 code implementation ICLR 2021 Konstantinos Vougioukas, Stavros Petridis, Maja Pantic

Domain translation is the process of transforming data from one domain to another while preserving the common semantics.

Translation

Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection

1 code implementation CVPR 2021 Alexandros Haliassos, Konstantinos Vougioukas, Stavros Petridis, Maja Pantic

Extensive experiments show that this simple approach significantly surpasses the state-of-the-art in terms of generalisation to unseen manipulations and robustness to perturbations, as well as shed light on the factors responsible for its performance.

DeepFake Detection Lipreading +2

Realistic Speech-Driven Facial Animation with GANs

no code implementations14 Jun 2019 Konstantinos Vougioukas, Stavros Petridis, Maja Pantic

We present an end-to-end system that generates videos of a talking head, using only a still image of a person and an audio clip containing speech, without relying on handcrafted intermediate features.

Audio-Visual Synchronization Lip Reading

End-to-End Speech-Driven Facial Animation with Temporal GANs

1 code implementation23 May 2018 Konstantinos Vougioukas, Stavros Petridis, Maja Pantic

To the best of our knowledge, this is the first method capable of generating subject independent realistic videos directly from raw audio.

Lip Reading

Cannot find the paper you are looking for? You can Submit a new open access paper.