Search Results for author: Zhihui Yang

Found 1 papers, 1 papers with code

Moirai: Towards Optimal Placement for Distributed Inference on Heterogeneous Devices

1 code implementation7 Dec 2023 Beibei Zhang, Hongwei Zhu, Feng Gao, Zhihui Yang, Sean Xiaoyang Wang

This paper presents Moirai that better exploits runtime inter-operator fusion in a model to render a coarsened computation graph, reducing the search space while maintaining the inter-operator optimization provided by inference backends.

Cannot find the paper you are looking for? You can Submit a new open access paper.