Search Results for author: Sudhakar Sah

Found 6 papers, 2 papers with code

QGen: On the Ability to Generalize in Quantization Aware Training

no code implementations17 Apr 2024 MohammadHossein AskariHemmat, Ahmadreza Jeddi, Reyhane Askari Hemmat, Ivan Lazarevich, Alexander Hoffman, Sudhakar Sah, Ehsan Saboori, Yvon Savaria, Jean-Pierre David

In this work, we investigate the generalization properties of quantized neural networks, a characteristic that has received little attention despite its implications on model performance.

Quantization

DeepliteRT: Computer Vision at the Edge

no code implementations19 Sep 2023 Saad Ashfaq, Alexander Hoffman, Saptarshi Mitra, Sudhakar Sah, MohammadHossein AskariHemmat, Ehsan Saboori

The proliferation of edge devices has unlocked unprecedented opportunities for deep learning model deployment in computer vision applications.

Quantization

Accelerating Deep Neural Networks via Semi-Structured Activation Sparsity

1 code implementation12 Sep 2023 Matteo Grimaldi, Darshan C. Ganji, Ivan Lazarevich, Sudhakar Sah

The demand for efficient processing of deep neural networks (DNNs) on embedded devices is a significant challenge limiting their deployment.

Image Classification object-detection +1

YOLOBench: Benchmarking Efficient Object Detectors on Embedded Systems

1 code implementation26 Jul 2023 Ivan Lazarevich, Matteo Grimaldi, Ravish Kumar, Saptarshi Mitra, Shahrukh Khan, Sudhakar Sah

We present YOLOBench, a benchmark comprised of 550+ YOLO-based object detection models on 4 different datasets and 4 different embedded hardware platforms (x86 CPU, ARM CPU, Nvidia GPU, NPU).

Benchmarking Neural Architecture Search +3

DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures using Lookup Tables

no code implementations18 Apr 2023 Darshan C. Ganji, Saad Ashfaq, Ehsan Saboori, Sudhakar Sah, Saptarshi Mitra, MohammadHossein AskariHemmat, Alexander Hoffman, Ahmed Hassanien, Mathieu Léonardon

A lot of recent progress has been made in ultra low-bit quantization, promising significant improvements in latency, memory footprint and energy consumption on edge devices.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.