no code implementations • 2 Feb 2024 • Chirag Chhablani, Nikhita Sharma, Jordan Hosier, Vijay K. Gurbani
Such light models can be trained perform well on number recognition specific tasks, competing with general models like Whisper or Google STT while using less than 80 minutes of training time and occupying at least an order of less memory resources.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2