1 code implementation • 8 Jul 2023 • Vikas Natesh, Andrew Sabot, H. T. Kung, Mark Ting
We propose Rosko -- row skipping outer products -- for deriving sparse matrix multiplication (SpMM) kernels in reducing computation and memory access requirements of deep neural networks (DNNs).
no code implementations • 12 Apr 2023 • Andrew Sabot, Vikas Natesh, H. T. Kung, Wei-Te Ting
We present the MEMA framework for the easy and quick derivation of efficient inference runtimes that minimize external memory accesses for matrix multiplication on TinyML systems.