Search Results for author: Chien-yu Huang

Found 8 papers, 4 papers with code

Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model

no code implementations • 4 Oct 2023 • Kai-Wei Chang, Ming-Hsin Chen, Yun-Ping Lin, Jing Neng Hsu, Paul Kuo-Ming Huang, Chien-yu Huang, Shang-Wen Li, Hung-Yi Lee

Notably, in the low-resource scenario, prompting consistently outperforms adapter tuning.

Cross-Lingual ASR slot-filling +1

Paper
Add Code

Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech

1 code implementation • 18 Sep 2023 • Chien-yu Huang, Ke-Han Lu, Shih-Heng Wang, Chi-Yuan Hsiao, Chun-Yi Kuan, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-Yi Lee

To achieve comprehensive coverage of diverse speech tasks and harness instruction tuning, we invite the community to collaborate and contribute, facilitating the dynamic growth of the benchmark.

101

Paper
Code

Toward Degradation-Robust Voice Conversion

no code implementations • 14 Oct 2021 • Chien-yu Huang, Kai-Wei Chang, Hung-Yi Lee

However, in real-world scenarios, it is difficult to collect clean utterances of a speaker, and they are usually degraded by noises or reverberations.

Denoising Speech Enhancement +1

Paper
Add Code

Improving Cross-Lingual Reading Comprehension with Self-Training

no code implementations • 8 May 2021 • Wei-Cheng Huang, Chien-yu Huang, Hung-Yi Lee

Substantial improvements have been made in machine reading comprehension, where the machine answers questions based on a given context.

Machine Reading Comprehension

Paper
Add Code

Utilizing Self-supervised Representations for MOS Prediction

6 code implementations • 7 Apr 2021 • Wei-Cheng Tseng, Chien-yu Huang, Wei-Tsung Kao, Yist Y. Lin, Hung-Yi Lee

In this paper, we use self-supervised pre-trained models for MOS prediction.

Voice Conversion

2,099

Paper
Code

Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech

1 code implementation • 6 Mar 2021 • Chung-Ming Chien, Jheng-Hao Lin, Chien-yu Huang, Po-chun Hsu, Hung-Yi Lee

The few-shot multi-speaker multi-style voice cloning task is to synthesize utterances with voice and speaking style similar to a reference speaker given only a few reference samples.

Voice Cloning Voice Conversion

1,622

Paper
Code

How Far Are We from Robust Voice Conversion: A Survey

no code implementations • 24 Nov 2020 • Tzu-Hsien Huang, Jheng-Hao Lin, Chien-yu Huang, Hung-Yi Lee

Voice conversion technologies have been greatly improved in recent years with the help of deep learning, but their capabilities of producing natural sounding utterances in different conditions remain unclear.

Speaker Identification Voice Conversion

Paper
Add Code

Defending Your Voice: Adversarial Attack on Voice Conversion

1 code implementation • 18 May 2020 • Chien-yu Huang, Yist Y. Lin, Hung-Yi Lee, Lin-shan Lee

We introduce human imperceptible noise into the utterances of a speaker whose voice is to be defended.

Adversarial Attack Voice Conversion

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.