Search Results for author: Jorn Peters

Found 2 papers, 1 papers with code

QBitOpt: Fast and Accurate Bitwidth Reallocation during Training

no code implementations10 Jul 2023 Jorn Peters, Marios Fournarakis, Markus Nagel, Mart van Baalen, Tijmen Blankevoort

By combining fast-to-compute sensitivities with efficient solvers during QAT, QBitOpt can produce mixed-precision networks with high task performance guaranteed to satisfy strict resource constraints.

Quantization

FP8 Quantization: The Power of the Exponent

1 code implementation19 Aug 2022 Andrey Kuzmin, Mart van Baalen, Yuwei Ren, Markus Nagel, Jorn Peters, Tijmen Blankevoort

We detail the choices that can be made for the FP8 format, including the important choice of the number of bits for the mantissa and exponent, and show analytically in which settings these choices give better performance.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.