no code implementations • 23 Jan 2024 • Chongke Bi, Xiaoxing Liu, Zhilei Liu
However, most existing NeRF-based methods either burden NeRF with complex learning tasks while lacking methods for supervised multimodal feature fusion, or cannot precisely map audio to the facial region related to speech movements.