Search Results for author: Dmitrii Ustiugov

Found 2 papers, 1 papers with code

ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models

no code implementations25 Jan 2024 Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete, Dmitrii Ustiugov, Yuvraj Patel, Luo Mai

This paper presents ServerlessLLM, a locality-enhanced serverless inference system for Large Language Models (LLMs).

Benchmarking, Analysis, and Optimization of Serverless Function Snapshots

1 code implementation16 Jan 2021 Dmitrii Ustiugov, Plamen Petrov, Marios Kogias, Edouard Bugnion, Boris Grot

We find that the execution time of a function started from a snapshot is 95% higher, on average, than when the same function is memory-resident.

Distributed, Parallel, and Cluster Computing

Cannot find the paper you are looking for? You can Submit a new open access paper.