Search Results for author: Matteo Grimaldi

Found 4 papers, 3 papers with code

Accelerating Deep Neural Networks via Semi-Structured Activation Sparsity

1 code implementation12 Sep 2023 Matteo Grimaldi, Darshan C. Ganji, Ivan Lazarevich, Sudhakar Sah

The demand for efficient processing of deep neural networks (DNNs) on embedded devices is a significant challenge limiting their deployment.

Image Classification object-detection +1

YOLOBench: Benchmarking Efficient Object Detectors on Embedded Systems

1 code implementation26 Jul 2023 Ivan Lazarevich, Matteo Grimaldi, Ravish Kumar, Saptarshi Mitra, Shahrukh Khan, Sudhakar Sah

We present YOLOBench, a benchmark comprised of 550+ YOLO-based object detection models on 4 different datasets and 4 different embedded hardware platforms (x86 CPU, ARM CPU, Nvidia GPU, NPU).

Benchmarking Neural Architecture Search +3

Dynamic ConvNets on Tiny Devices via Nested Sparsity

no code implementations7 Mar 2022 Matteo Grimaldi, Luca Mocerino, Antonio Cipolletta, Andrea Calimera

This work introduces a new training and compression pipeline to build Nested Sparse ConvNets, a class of dynamic Convolutional Neural Networks (ConvNets) suited for inference tasks deployed on resource-constrained devices at the edge of the Internet-of-Things.

Image Classification object-detection +1

EAST: Encoding-Aware Sparse Training for Deep Memory Compression of ConvNets

1 code implementation20 Dec 2019 Matteo Grimaldi, Valentino Peluso, Andrea Calimera

The implementation of Deep Convolutional Neural Networks (ConvNets) on tiny end-nodes with limited non-volatile memory space calls for smart compression strategies capable of shrinking the footprint yet preserving predictive accuracy.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.