Talking Head Generation

40 papers with code • 7 benchmarks • 3 datasets

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Benchmarks

Add a Result

These leaderboards are used to track progress in Talking Head Generation

Dataset	Best Model	Compare
VoxCeleb2 - 1-shot learning	Fast Bi-layer Avatars (medium size)	See all
VoxCeleb1 - 1-shot learning	Few-shot Adversarial Model	See all
VoxCeleb1 - 8-shot learning	Few-shot Adversarial Model	See all
VoxCeleb1 - 32-shot learning	Few-shot Adversarial Model	See all
VoxCeleb2 - 8-shot learning	CainGAN	See all
VoxCeleb2 - 32-shot learning	Few-shot Adversarial Model	See all
100 sleep nights of 8 caregivers	Ashok	See all

Datasets

Subtasks

Unconstrained Lip-synchronization

Latest papers

Most implemented Social Latest No code

DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

harlanhong/cvpr2022-dagan • • 10 May 2023

In this work, firstly, we present a novel self-supervised method for learning dense 3D facial geometry (ie, depth) from face videos, without requiring camera parameters and 3D geometry annotations in training.

940

10 May 2023

Paper
Code

Face Animation with an Attribute-Guided Diffusion Model

zengbohan0217/fadm • • 6 Apr 2023

Face animation has achieved much progress in computer vision.

06 Apr 2023

Paper
Code

Emotionally Enhanced Talking Face Generation

sahilg06/EmoGen • • 21 Mar 2023

To mitigate this, we build a talking face generation framework conditioned on a categorical emotion to generate videos with appropriate expressions, making them more realistic and convincing.

320

21 Mar 2023

Paper
Code

DisCoHead: Audio-and-Video-Driven Talking Head Generation by Disentangled Control of Head Pose and Facial Expressions

deepbrainai-research/koeba • 14 Mar 2023

We enhance the efficiency of DisCoHead by integrating a dense motion estimator and the encoder of a generator which are originally separate modules.

14 Mar 2023

Paper
Code

DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation

sstzal/DiffTalk • • CVPR 2023

In this way, the proposed DiffTalk is capable of producing high-quality talking head videos in synchronization with the source audio, and more importantly, it can be naturally generalized across different identities without any further fine-tuning.

408

10 Jan 2023

Paper
Code

StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles

fuxivirtualhuman/styletalk • • 3 Jan 2023

In a nutshell, we aim to attain a speaking style from an arbitrary reference speaking video and then drive the one-shot portrait to speak with the reference speaking style and another piece of audio.

469

03 Jan 2023

Paper
Code

MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation

Meta-Portrait/MetaPortrait • • CVPR 2023

In this work, we propose an ID-preserving talking head generation framework, which advances previous methods in two aspects.

499

15 Dec 2022

Paper
Code

SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

winfredy/sadtalker • • CVPR 2023

We present SadTalker, which generates 3D motion coefficients (head pose, expression) of the 3DMM from audio and implicitly modulates a novel 3D-aware face render for talking head generation.

10,508

22 Nov 2022

Paper
Code

Autoregressive GAN for Semantic Unconditional Head Motion Generation

louisbearing/unconditionalheadmotion • • 2 Nov 2022

In this work, we address the task of unconditional head motion generation to animate still human faces in a low-dimensional semantic space from a single reference pose.

02 Nov 2022

Paper
Code

Compressing Video Calls using Synthetic Talking Heads

berlin0610/awesome-generative-face-video-coding • • 7 Oct 2022

We use a state-of-the-art face reenactment network to detect key points in the non-pivot frames and transmit them to the receiver.

07 Oct 2022

Paper
Code

Talking Head Generation

Benchmarks Add a Result

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result