1 code implementation • 26 May 2022 • Andre Xian Ming Chang, Parth Khopkar, Bashar Romanous, Abhishek Chaurasia, Patrick Estep, Skyler Windh, Doug Vanesko, Sheik Dawood Beer Mohideen, Eugenio Culurciello
In this work we propose a Reinforcement Learning framework with Global Graph Attention (GGA) module and output masking of invalid placements to find and optimize instruction schedules.
no code implementations • 8 Aug 2017 • Vinayak Gokhale, Aliasger Zaidy, Andre Xian Ming Chang, Eugenio Culurciello
Snowflake is able to achieve a computational efficiency of over 91% on modern CNN models.
Hardware Architecture
no code implementations • 1 Aug 2017 • Andre Xian Ming Chang, Aliasger Zaidy, Vinayak Gokhale, Eugenio Culurciello
Given a programmable hardware accelerator with a CNN oriented custom instructions set, the compiler's task is to exploit the hardware's full potential, while abiding with the hardware constraints and maintaining generality to run different CNN models with varying workload properties.
1 code implementation • 17 Nov 2015 • Andre Xian Ming Chang, Berin Martini, Eugenio Culurciello
Recurrent Neural Networks (RNNs) have the ability to retain memory and learn data sequences.