Search Results for author: Matthew LeMay

Found 1 papers, 1 papers with code

Perseus: Characterizing Performance and Cost of Multi-Tenant Serving for CNN Models

1 code implementation • 5 Dec 2019 • Matthew LeMay, Shijian Li, Tian Guo

Leveraging Perseus, we evaluated the inference throughput and cost for serving various models and demonstrated that multi-tenant model serving led to up to 12% cost reduction.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.