no code implementations • CVPR 2022 • Zhaoyang Zeng, Yongsheng Luo, Zhenhua Liu, Fengyun Rao, Dian Li, Weidong Guo, Zhen Wen
In this paper, we propose the Tencent-MVSE dataset, which is the first benchmark dataset for the multi-modal video similarity evaluation task.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • 10 Mar 2020 • Yong Guo, Yongsheng Luo, Zhenhao He, Jin Huang, Jian Chen
To this end, we design a hierarchical SR search space and propose a hierarchical controller for architecture search.