1 code implementation • 12 Sep 2023 • Matteo Grimaldi, Darshan C. Ganji, Ivan Lazarevich, Sudhakar Sah
The demand for efficient processing of deep neural networks (DNNs) on embedded devices is a significant challenge limiting their deployment.
1 code implementation • 26 Jul 2023 • Ivan Lazarevich, Matteo Grimaldi, Ravish Kumar, Saptarshi Mitra, Shahrukh Khan, Sudhakar Sah
We present YOLOBench, a benchmark comprised of 550+ YOLO-based object detection models on 4 different datasets and 4 different embedded hardware platforms (x86 CPU, ARM CPU, Nvidia GPU, NPU).
no code implementations • 7 Mar 2022 • Matteo Grimaldi, Luca Mocerino, Antonio Cipolletta, Andrea Calimera
This work introduces a new training and compression pipeline to build Nested Sparse ConvNets, a class of dynamic Convolutional Neural Networks (ConvNets) suited for inference tasks deployed on resource-constrained devices at the edge of the Internet-of-Things.
1 code implementation • 20 Dec 2019 • Matteo Grimaldi, Valentino Peluso, Andrea Calimera
The implementation of Deep Convolutional Neural Networks (ConvNets) on tiny end-nodes with limited non-volatile memory space calls for smart compression strategies capable of shrinking the footprint yet preserving predictive accuracy.