1 code implementation • 19 Dec 2023 • Weipeng Guan, Peiyu Chen, Huibin Zhao, Yu Wang, Peng Lu
To the best of our knowledge, this is the first non-learning work to realize event-based dense mapping.
1 code implementation • 25 Sep 2022 • Weipeng Guan, Peiyu Chen, Yuhan Xie, Peng Lu
Compared with the standard cameras, it can provide reliable visual perception during high-speed motions and in high dynamic range scenarios.
1 code implementation • ACL 2022 • Xichen Pan, Peiyu Chen, Yichen Gong, Helong Zhou, Xinbing Wang, Zhouhan Lin
In particular, audio and visual front-ends are trained on large-scale unimodal datasets, then we integrate components of both front-ends into a larger multimodal framework which learns to recognize parallel audio-visual data into characters through a combination of CTC and seq2seq decoding.
Ranked #2 on Automatic Speech Recognition (ASR) on LRS2
Audio-Visual Speech Recognition Automatic Speech Recognition (ASR) +7