Search Results for author: Ting-yao Hu

Found 11 papers, 1 papers with code

Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models

no code implementations • 18 Sep 2023 • Hsuan Su, Ting-yao Hu, Hema Swetha Koppula, Raviteja Vemulapalli, Jen-Hao Rick Chang, Karren Yang, Gautam Varma Mantena, Oncel Tuzel

In this paper, we propose a new strategy for adapting ASR models to new target domains without any text or speech from those domains.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis

no code implementations • 27 Mar 2023 • Karren Yang, Ting-yao Hu, Jen-Hao Rick Chang, Hema Swetha Koppula, Oncel Tuzel

Here, we ask two fundamental questions about this strategy: when is synthetic data effective for personalization, and why is it effective in those cases?

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

I see what you hear: a vision-inspired method to localize words

no code implementations • 24 Oct 2022 • Mohammad Samragh, Arnav Kundu, Ting-yao Hu, Minsik Cho, Aman Chadha, Ashish Shrivastava, Oncel Tuzel, Devang Naik

This paper explores the possibility of using visual object detection techniques for word localization in speech data.

Object object-detection +2

Paper
Add Code

Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition

no code implementations • 21 Oct 2021 • Ting-yao Hu, Mohammadreza Armandpour, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Koppula, Oncel Tuzel

With recent advances in speech synthesis, synthetic data is becoming a viable alternative to real data for training speech recognition models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Subspace Representation Learning for Few-shot Image Classification

no code implementations • 2 May 2021 • Ting-yao Hu, Zhi-Qi Cheng, Alexander G. Hauptmann

In this paper, we propose a subspace representation learning (SRL) framework to tackle few-shot image classification tasks.

Classification Few-Shot Image Classification +3

Paper
Add Code

Pose Guided Person Image Generation with Hidden p-Norm Regression

no code implementations • 19 Feb 2021 • Ting-yao Hu, Alexander G. Hauptmann

In this paper, we propose a novel approach to solve the pose guided person image generation task.

Image Generation regression

Paper
Add Code

SapAugment: Learning A Sample Adaptive Policy for Data Augmentation

no code implementations • 2 Nov 2020 • Ting-yao Hu, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Koppula, Stefan Braun, Kyuyeon Hwang, Ozlem Kalinli, Oncel Tuzel

Our policy adapts the augmentation parameters based on the training loss of the data samples.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Project RISE: Recognizing Industrial Smoke Emissions

2 code implementations • 13 May 2020 • Yen-Chia Hsu, Ting-Hao 'Kenneth' Huang, Ting-yao Hu, Paul Dille, Sean Prendi, Ryan Hoffman, Anastasia Tsuhlares, Jessica Pachuta, Randy Sargent, Illah Nourbakhsh

Industrial smoke emissions pose a significant concern to human health.

Action Recognition Temporal Action Localization

111

Paper
Code

Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis

no code implementations • 9 Mar 2020 • Ting-yao Hu, Ashish Shrivastava, Oncel Tuzel, Chandra Dhir

We present a method to generate speech from input text and a style vector that is extracted from a reference speech signal in an unsupervised manner, i. e., no style annotation, such as speaker information, is required.

Decoder Speech Synthesis

Paper
Add Code

Multi-shot Person Re-identification through Set Distance with Visual Distributional Representation

no code implementations • 3 Aug 2018 • Ting-yao Hu, Xiaojun Chang, Alexander G. Hauptmann

In this work, we propose the idea of visual distributional representation, which interprets an image set as samples drawn from an unknown distribution in appearance feature space.

Person Re-Identification

Paper
Add Code

Complex spectrogram enhancement by convolutional neural network with multi-metrics learning

no code implementations • 27 Apr 2017 • Szu-Wei Fu, Ting-yao Hu, Yu Tsao, Xugang Lu

This paper aims to address two issues existing in the current speech enhancement methods: 1) the difficulty of phase estimations; 2) a single objective function cannot consider multiple metrics simultaneously.

Speech Enhancement

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.