Search Results for author: Jiahong Yuan

Found 15 papers, 2 papers with code

Using Mixed Incentives to Document Xi’an Guanzhong

no code implementations NIDCP (LREC) 2022 Juhong Zhan, Yue Jiang, Christopher Cieri, Mark Liberman, Jiahong Yuan, Yiya Chen, Odette Scharenborg

This paper describes our use of mixed incentives and the citizen science portal LanguageARC to prepare, collect and quality control a large corpus of object namings for the purpose of providing speech data to document the under-represented Guanzhong dialect of Chinese spoken in the Shaanxi province in the environs of Xi’an.

Data-Driven Adaptive Simultaneous Machine Translation

no code implementations27 Apr 2022 Guangxu Xun, Mingbo Ma, Yuchen Bian, Xingyu Cai, Jiaji Huang, Renjie Zheng, Junkun Chen, Jiahong Yuan, Kenneth Church, Liang Huang

In simultaneous translation (SimulMT), the most widely used strategy is the wait-k policy thanks to its simplicity and effectiveness in balancing translation quality and latency.

Machine Translation Sentence +1

The Role of Phonetic Units in Speech Emotion Recognition

no code implementations2 Aug 2021 Jiahong Yuan, Xingyu Cai, Renjie Zheng, Liang Huang, Kenneth Church

Models of phonemes, broad phonetic classes, and syllables all significantly outperform the utterance model, demonstrating that phonetic units are helpful and should be incorporated in speech emotion recognition.

Speech Emotion Recognition speech-recognition +1

Automatic recognition of suprasegmentals in speech

no code implementations2 Aug 2021 Jiahong Yuan, Neville Ryant, Xingyu Cai, Kenneth Church, Mark Liberman

This study reports our efforts to improve automatic recognition of suprasegmentals by fine-tuning wav2vec 2. 0 with CTC, a method that has been successful in automatic speech recognition.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

On Attention Redundancy: A Comprehensive Study

no code implementations NAACL 2021 Yuchen Bian, Jiaji Huang, Xingyu Cai, Jiahong Yuan, Kenneth Church

(What) We define and focus the study on redundancy matrices generated from pre-trained and fine-tuned BERT-base model for GLUE datasets.

Model Compression Sentence

Text2Video: Text-driven Talking-head Video Synthesis with Personalized Phoneme-Pose Dictionary

1 code implementation29 Apr 2021 Sibo Zhang, Jiahong Yuan, Miao Liao, Liangjun Zhang

With the advance of deep learning technology, automatic video generation from audio or text has become an emerging and promising research topic.

Generative Adversarial Network Talking Face Generation +1

Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training

no code implementations Findings of the Association for Computational Linguistics 2020 Renjie Zheng, Mingbo Ma, Baigong Zheng, Kaibo Liu, Jiahong Yuan, Kenneth Church, Liang Huang

Simultaneous speech-to-speech translation is widely useful but extremely challenging, since it needs to generate target-language speech concurrently with the source-language speech, with only a few seconds delay.

Sentence Speech-to-Speech Translation +1

On the Role of Style in Parsing Speech with Neural Models

no code implementations8 Oct 2020 Trang Tran, Jiahong Yuan, Yang Liu, Mari Ostendorf

The differences in written text and conversational speech are substantial; previous parsers trained on treebanked text have given very poor results on spontaneous speech.

Sparseness-constrained Nonnegative Tensor Factorization for Detecting Topics at Different Time Scales

1 code implementation4 Oct 2020 Lara Kassab, Alona Kryshchenko, Hanbaek Lyu, Denali Molitor, Deanna Needell, Elizaveta Rebrova, Jiahong Yuan

Further, we propose quantitative ways to measure the topic length and demonstrate the ability of S-NCPD (as well as its online variant) to discover short and long-lasting temporal topics in a controlled manner in semi-synthetic and real-world data including news headlines.

Tensor Decomposition

Cannot find the paper you are looking for? You can Submit a new open access paper.