no code implementations • EURALI (LREC) 2022 • Chihiro Taguchi, Sei Iwata, Taro Watanabe
Experimenting on NMCTT and the Turkish-German CS treebank (SAGT), we demonstrate that the proposed annotation scheme introduced in NMCTT can improve the performance of the subword-level language identification.
no code implementations • NAACL (CALCS) 2021 • Chihiro Taguchi, Yusuke Sakai, Taro Watanabe
Given this situation, we proposed a transliteration method based on subword-level language identification.
1 code implementation • 23 Apr 2024 • Chihiro Taguchi, Jefferson Saransig, Dayana Velásquez, David Chiang
This dataset, the ASR model, and the code used to develop them will be publicly available.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 7 Aug 2023 • Chihiro Taguchi, Yusuke Sakai, Parisa Haghani, David Chiang
This paper presents a state-of-the-art model for transcribing speech in any language into the International Phonetic Alphabet (IPA).