Search Results for author: Chirag Patel

Found 9 papers, 0 papers with code

How to Parameterize Asymmetric Quantization Ranges for Quantization-Aware Training

no code implementations • 25 Apr 2024 • Jaeseong You, Minseop Park, Kyunggeun Lee, Seokjun An, Chirag Patel, Markus Nage

This paper investigates three different parameterizations of asymmetric uniform quantization for quantization-aware training: (1) scale and offset, (2) minimum and maximum, and (3) beta and gamma.

Paper
Add Code

DONNAv2 -- Lightweight Neural Architecture Search for Vision tasks

no code implementations • 26 Sep 2023 • Sweta Priyadarshi, Tianyu Jiang, Hsin-Pai Cheng, Sendil Krishna, Viswanath Ganapathy, Chirag Patel

Here, we have developed an elegant approach to eliminate building the accuracy predictor and extend DONNA to a computationally efficient setting.

Image Denoising Knowledge Distillation +4

Paper
Add Code

Softmax Bias Correction for Quantized Generative Models

no code implementations • 4 Sep 2023 • Nilesh Prasad Pandey, Marios Fournarakis, Chirag Patel, Markus Nagel

Post-training quantization (PTQ) is the go-to compression technique for large generative models, such as stable diffusion or large language models.

Language Modelling Quantization

Paper
Add Code

Speaker Diaphragm Excursion Prediction: deep attention and online adaptation

no code implementations • 11 May 2023 • Yuwei Ren, Matt Zivney, Yin Huang, Eddie Choy, Chirag Patel, Hao Xu

Speaker protection algorithm is to leverage the playback signal properties to prevent over excursion while maintaining maximum loudness, especially for the mobile phone with tiny loudspeakers.

Deep Attention Quantization

Paper
Add Code

FP8 versus INT8 for efficient deep learning inference

no code implementations • 31 Mar 2023 • Mart van Baalen, Andrey Kuzmin, Suparna S Nair, Yuwei Ren, Eric Mahurin, Chirag Patel, Sundar Subramanian, Sanghyuk Lee, Markus Nagel, Joseph Soriaga, Tijmen Blankevoort

We theoretically show the difference between the INT and FP formats for neural networks and present a plethora of post-training quantization and quantization-aware-training results to show how this theory translates to practice.

Quantization

Paper
Add Code

A Practical Mixed Precision Algorithm for Post-Training Quantization

no code implementations • 10 Feb 2023 • Nilesh Prasad Pandey, Markus Nagel, Mart van Baalen, Yin Huang, Chirag Patel, Tijmen Blankevoort

We experimentally validate our proposed method on several computer vision tasks, natural language processing tasks and many different networks, and show that we can find mixed precision networks that provide a better trade-off between accuracy and efficiency than their homogeneous bit-width equivalents.

Quantization