no code implementations • 25 Apr 2024 • Jaeseong You, Minseop Park, Kyunggeun Lee, Seokjun An, Chirag Patel, Markus Nage
This paper investigates three different parameterizations of asymmetric uniform quantization for quantization-aware training: (1) scale and offset, (2) minimum and maximum, and (3) beta and gamma.
no code implementations • 26 Sep 2023 • Sweta Priyadarshi, Tianyu Jiang, Hsin-Pai Cheng, Sendil Krishna, Viswanath Ganapathy, Chirag Patel
Here, we have developed an elegant approach to eliminate building the accuracy predictor and extend DONNA to a computationally efficient setting.
no code implementations • 4 Sep 2023 • Nilesh Prasad Pandey, Marios Fournarakis, Chirag Patel, Markus Nagel
Post-training quantization (PTQ) is the go-to compression technique for large generative models, such as stable diffusion or large language models.
no code implementations • 11 May 2023 • Yuwei Ren, Matt Zivney, Yin Huang, Eddie Choy, Chirag Patel, Hao Xu
Speaker protection algorithm is to leverage the playback signal properties to prevent over excursion while maintaining maximum loudness, especially for the mobile phone with tiny loudspeakers.
no code implementations • 31 Mar 2023 • Mart van Baalen, Andrey Kuzmin, Suparna S Nair, Yuwei Ren, Eric Mahurin, Chirag Patel, Sundar Subramanian, Sanghyuk Lee, Markus Nagel, Joseph Soriaga, Tijmen Blankevoort
We theoretically show the difference between the INT and FP formats for neural networks and present a plethora of post-training quantization and quantization-aware-training results to show how this theory translates to practice.
no code implementations • 10 Feb 2023 • Nilesh Prasad Pandey, Markus Nagel, Mart van Baalen, Yin Huang, Chirag Patel, Tijmen Blankevoort
We experimentally validate our proposed method on several computer vision tasks, natural language processing tasks and many different networks, and show that we can find mixed precision networks that provide a better trade-off between accuracy and efficiency than their homogeneous bit-width equivalents.
no code implementations • 14 Nov 2022 • Yuwei Ren, Jiuyuan Lu, Andrian Beletchi, Yin Huang, Ilia Karmanov, Daniel Fontijne, Chirag Patel, Hao Xu
Range-Doppler information (RDI) is obtained by using pulse Doppler radar for gesture recognition.
no code implementations • 20 Jan 2022 • Sangeetha Siddegowda, Marios Fournarakis, Markus Nagel, Tijmen Blankevoort, Chirag Patel, Abhijit Khobare
chapter 4) and quantization-aware training (QAT, cf.
no code implementations • 28 Nov 2019 • Jangho Kim, Yash Bhalgat, Jinwon Lee, Chirag Patel, Nojun Kwak
First, Self-studying (SS) phase fine-tunes a quantized low-precision student network without KD to obtain a good initialization.