Search Results for author: Matthew LeMay

Found 1 papers, 1 papers with code

Perseus: Characterizing Performance and Cost of Multi-Tenant Serving for CNN Models

1 code implementation5 Dec 2019 Matthew LeMay, Shijian Li, Tian Guo

Leveraging Perseus, we evaluated the inference throughput and cost for serving various models and demonstrated that multi-tenant model serving led to up to 12% cost reduction.

Cannot find the paper you are looking for? You can Submit a new open access paper.