A collection of single speaker speech datasets for ten languages. It is composed of short audio clips from LibriVox audiobooks and their aligned texts.
Source: CSS10: A Collection of Single Speaker Speech Datasets for 10 LanguagesPaper | Code | Results | Date | Stars |
---|