no code implementations • 19 Jul 2023 • Javad Peymanfard, Vahid Saeedi, Mohammad Reza Mohammadi, Hossein Zeinali, Nasser Mozayani
We evaluate our approach on various tasks, including word-level and sentence-level lip reading, and audiovisual speech recognition using the Arman-AV dataset, a largescale Persian corpus.
no code implementations • 8 Apr 2023 • Javad Peymanfard, Ali Lashini, Samin Heydarian, Hossein Zeinali, Nasser Mozayani
Lip-reading has made impressive progress in recent years, driven by advances in deep learning.
no code implementations • 7 Apr 2023 • Mohammd Hasan Shamgholi, Vahid Saeedi, Javad Peymanfard, Leila Alhabib, Hossein Zeinali
TTS, or text-to-speech, is a complicated process that can be accomplished through appropriate modeling using deep learning methods.
no code implementations • 21 Jan 2023 • Javad Peymanfard, Samin Heydarian, Ali Lashini, Hossein Zeinali, Mohammad Reza Mohammadi, Nasser Mozayani
In addition, we have proposed a technique to detect visemes (a visual equivalent of a phoneme) in Persian.
Audio-Visual Speech Recognition Automatic Speech Recognition +5
1 code implementation • 24 Jul 2022 • Hossein Mirzaee, Javad Peymanfard, Hamid Habibzadeh Moshtaghin, Hossein Zeinali
With the recent proliferation of open textual data on social media platforms, Emotion Detection (ED) from Text has received more attention over the past years.
no code implementations • 10 Apr 2021 • Javad Peymanfard, Mohammad Reza Mohammadi, Hossein Zeinali, Nasser Mozayani
Lip-reading is the operation of recognizing speech from lip movements.