Search Results for author: Alexander Hoffman

Found 4 papers, 0 papers with code

QGen: On the Ability to Generalize in Quantization Aware Training

no code implementations17 Apr 2024 MohammadHossein AskariHemmat, Ahmadreza Jeddi, Reyhane Askari Hemmat, Ivan Lazarevich, Alexander Hoffman, Sudhakar Sah, Ehsan Saboori, Yvon Savaria, Jean-Pierre David

In this work, we investigate the generalization properties of quantized neural networks, a characteristic that has received little attention despite its implications on model performance.

Quantization

DeepliteRT: Computer Vision at the Edge

no code implementations19 Sep 2023 Saad Ashfaq, Alexander Hoffman, Saptarshi Mitra, Sudhakar Sah, MohammadHossein AskariHemmat, Ehsan Saboori

The proliferation of edge devices has unlocked unprecedented opportunities for deep learning model deployment in computer vision applications.

Quantization

DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures using Lookup Tables

no code implementations18 Apr 2023 Darshan C. Ganji, Saad Ashfaq, Ehsan Saboori, Sudhakar Sah, Saptarshi Mitra, MohammadHossein AskariHemmat, Alexander Hoffman, Ahmed Hassanien, Mathieu Léonardon

A lot of recent progress has been made in ultra low-bit quantization, promising significant improvements in latency, memory footprint and energy consumption on edge devices.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.