1 code implementation • 26 Aug 2019 • Grzegorz Kwasniewski, Marko Kabić, Maciej Besta, Joost VandeVondele, Raffaele Solcà, Torsten Hoefler
The key idea behind COSMA is to derive an optimal (up to a factor of 0. 03\% for 10MB of fast memory) sequential schedule and then parallelize it, preserving I/O optimality.
Computational Complexity Distributed, Parallel, and Cluster Computing Performance