no code implementations • 28 Jun 2019 • Yaman Kumar, Rohit Jain, Khwaja Mohd. Salik, Rajiv Ratn Shah, Yifang Yin, Roger Zimmermann
The model takes silent videos as input and produces speech as the output.
Lipreading