no code implementations • 30 Sep 2022 • Chendong Zhao, Jianzong Wang, Wen qi Wei, Xiaoyang Qu, Haoqian Wang, Jing Xiao
For multi-head attention in Transformer ASR, it is not easy to model monotonic alignments in different heads.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1