1 code implementation • ECCV 2020 • Zhe Niu, Brian Mak
In this paper, we propose novel stochastic modeling of various components of a continuous sign language recognition (CSLR) system that is based on the transformer encoder and connectionist temporal classification (CTC).
Ranked #10 on Sign Language Recognition on RWTH-PHOENIX-Weather 2014 T
no code implementations • ICCV 2023 • Zhe Niu, Brian Mak
Most lip-to-speech (LTS) synthesis models are trained and evaluated under the assumption that the audio-video pairs in the dataset are perfectly synchronized.