no code implementations • 19 Apr 2021 • Albert Njoroge Kahira, Truong Thao Nguyen, Leonardo Bautista Gomez, Ryousei Takano, Rosa M Badia, Mohamed Wahib
Deep Neural Network (DNN) frameworks use distributed training to enable faster time to convergence and alleviate memory capacity limitations when training large models and/or using high dimension inputs.
2 code implementations • 30 Nov 2019 • Muhammet Abdullah Soyturk, Konstantinos Parasyris, Behzad Salami, Osman Unsal, Gulay Yalcin, Leonardo Bautista Gomez
During our study we corrupt the L1-Dcache of the simulated system and we monitor the behavior of the two types of fault maps on the resiliency of six benchmarks.
Performance Hardware Architecture