Search Results for author: Zhe Niu

Found 2 papers, 1 papers with code

Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition

1 code implementation • ECCV 2020 • Zhe Niu, Brian Mak

In this paper, we propose novel stochastic modeling of various components of a continuous sign language recognition (CSLR) system that is based on the transformer encoder and connectionist temporal classification (CTC).

Ranked #10 on Sign Language Recognition on RWTH-PHOENIX-Weather 2014 T

Sign Language Recognition

Paper
Code

On the Audio-visual Synchronization for Lip-to-Speech Synthesis

no code implementations • ICCV 2023 • Zhe Niu, Brian Mak

Most lip-to-speech (LTS) synthesis models are trained and evaluated under the assumption that the audio-video pairs in the dataset are perfectly synchronized.

Audio-Visual Synchronization Lip to Speech Synthesis +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.