Search Results for author: Junyeol Lee

Found 1 papers, 0 papers with code

ExeGPT: Constraint-Aware Resource Scheduling for LLM Inference

no code implementations15 Mar 2024 Hyungjun Oh, Kihong Kim, JaeMin Kim, Sungkyun Kim, Junyeol Lee, Du-Seong Chang, Jiwon Seo

This paper presents ExeGPT, a distributed system designed for constraint-aware LLM inference.

Scheduling

Cannot find the paper you are looking for? You can Submit a new open access paper.