no code implementations • 29 Mar 2022 • Miguel Del Rio, Peter Ha, Quinten McNamara, Corey Miller, Shipra Chandra
Beyond that, there is a lack of real-world, accented corpora to properly benchmark academic and commercial models.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
2 code implementations • 22 Apr 2021 • Miguel Del Rio, Natalie Delworth, Ryan Westerman, Michelle Huang, Nishchal Bhandari, Joseph Palakapilly, Quinten McNamara, Joshua Dong, Piotr Zelasko, Miguel Jette
Commonly used speech corpora inadequately challenge academic and commercial ASR systems.
no code implementations • 21 Apr 2021 • Arthur Hinsvark, Natalie Delworth, Miguel Del Rio, Quinten McNamara, Joshua Dong, Ryan Westerman, Michelle Huang, Joseph Palakapilly, Jennifer Drexler, Ilya Pirkin, Nishchal Bhandari, Miguel Jette
Automatic Speech Recognition (ASR) systems generalize poorly on accented speech.
no code implementations • 20 Feb 2017 • Quinten McNamara, Alejandro de la Vega, Tal Yarkoni
Pliers is an open-source Python package that supports standardized annotation of diverse data types (video, images, audio, and text), and is expressly with both ease-of-use and extensibility in mind.
no code implementations • 18 Nov 2016 • Ye Zhang, Md Mustafizur Rahman, Alex Braylan, Brandon Dang, Heng-Lu Chang, Henna Kim, Quinten McNamara, Aaron Angert, Edward Banner, Vivek Khetan, Tyler McDonnell, An Thanh Nguyen, Dan Xu, Byron C. Wallace, Matthew Lease
A recent "third wave" of Neural Network (NN) approaches now delivers state-of-the-art performance in many machine learning tasks, spanning speech recognition, computer vision, and natural language processing.