no code implementations • 4 Feb 2023 • Feng Xue, Yu Li, Deyin Liu, Yincen Xie, Lin Wu, Richang Hong
However, generalizing these methods to unseen speakers incurs catastrophic performance degradation due to the limited number of speakers in training bank and the evident visual variations caused by the shape/color of lips for different speakers.