Search Results for author: Zhe Niu

Found 2 papers, 1 papers with code

Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition

1 code implementation ECCV 2020 Zhe Niu, Brian Mak

In this paper, we propose novel stochastic modeling of various components of a continuous sign language recognition (CSLR) system that is based on the transformer encoder and connectionist temporal classification (CTC).

Sign Language Recognition

On the Audio-visual Synchronization for Lip-to-Speech Synthesis

no code implementations ICCV 2023 Zhe Niu, Brian Mak

Most lip-to-speech (LTS) synthesis models are trained and evaluated under the assumption that the audio-video pairs in the dataset are perfectly synchronized.

Audio-Visual Synchronization Lip to Speech Synthesis +1

Cannot find the paper you are looking for? You can Submit a new open access paper.