Search Results for author: Erwin Laure

Found 2 papers, 0 papers with code

Characterizing Deep-Learning I/O Workloads in TensorFlow

no code implementations6 Oct 2018 Steven W. D. Chien, Stefano Markidis, Chaitanya Prasad Sishtla, Luis Santos, Pawel Herman, Sai Narasimhamurthy, Erwin Laure

To measure TensorFlow I/O performance, we first design a micro-benchmark to measure TensorFlow reads, and then use a TensorFlow mini-application based on AlexNet to measure the performance cost of I/O and checkpointing in TensorFlow.

Distributed, Parallel, and Cluster Computing

NVIDIA Tensor Core Programmability, Performance & Precision

no code implementations11 Mar 2018 Stefano Markidis, Steven Wei Der Chien, Erwin Laure, Ivy Bo Peng, Jeffrey S. Vetter

After experimenting with different approaches, we found that NVIDIA Tensor Cores can deliver up to 83 Tflops/s in mixed precision on a Tesla V100 GPU, seven and three times the performance in single and half precision respectively.

Distributed, Parallel, and Cluster Computing Performance

Cannot find the paper you are looking for? You can Submit a new open access paper.