Search Results for author: Yukang Liang

Found 2 papers, 0 papers with code

End-to-End Word-Level Pronunciation Assessment with MASK Pre-training

no code implementations • 5 Jun 2023 • Yukang Liang, Kaitao Song, Shaoguang Mao, Huiqiang Jiang, Luna Qiu, Yuqing Yang, Dongsheng Li, Linli Xu, Lili Qiu

Pronunciation assessment is a major challenge in the computer-aided pronunciation training system, especially at the word (phoneme)-level.

Paper
Add Code

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA

no code implementations • 4 Apr 2023 • Yongxin Zhu, Zhen Liu, Yukang Liang, Xin Li, Hao liu, Changcun Bao, Linli Xu

Different to conventional STVQA models which take the linguistic semantics and visual semantics in scene text as two separate features, in this paper, we propose a paradigm of "Locate Then Generate" (LTG), which explicitly unifies this two semantics with the spatial bounding box as a bridge connecting them.

Answer Generation Language Modelling +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.