Search Results for author: Andreas Gerstlauer

Found 3 papers, 0 papers with code

A Survey of Distributed Learning in Cloud, Mobile, and Edge Settings

no code implementations23 May 2024 Madison Threadgill, Andreas Gerstlauer

In the era of deep learning (DL), convolutional neural networks (CNNs), and large language models (LLMs), machine learning (ML) models are becoming increasingly complex, demanding significant computational resources for both inference and training stages.

Computational Efficiency

MAFAT: Memory-Aware Fusing and Tiling of Neural Networks for Accelerated Edge Inference

no code implementations14 Jul 2021 Jackson Farley, Andreas Gerstlauer

Distributed partitioning approaches can, however, also be used to run in a reduced memory footprint on a single device by subdividing the network into smaller operations.

object-detection Object Detection +1

Virtual-Link: A Scalable Multi-Producer, Multi-Consumer Message Queue Architecture for Cross-Core Communication

no code implementations9 Dec 2020 Qinzhe Wu, Jonathan Beard, Ashen Ekanayake, Andreas Gerstlauer, Lizy K. John

Cross-core communication is increasingly a bottleneck as the number of processing elements increase per system-on-chip.

Hardware Architecture

Cannot find the paper you are looking for? You can Submit a new open access paper.