1 code implementation • 15 Dec 2023 • Bartosz Wójcik, Alessio Devoto, Karol Pustelnik, Pasquale Minervini, Simone Scardapane
The computational cost of transformer models makes them inefficient in low-latency or low-power applications.
Quantization speech-recognition +1