Search Results for author: Shaoyuan Chen

Found 2 papers, 0 papers with code

Efficient and Economic Large Language Model Inference with Attention Offloading

no code implementations3 May 2024 Shaoyuan Chen, Yutong Lin, Mingxing Zhang, Yongwei Wu

To enhance the efficiency and cost-effectiveness of LLM serving, we introduce the concept of attention offloading.

Language Modelling Large Language Model

Joint Transceiver Design Based on Dictionary Learning Algorithm for SCMA

no code implementations30 Oct 2020 Shanshan Zhang, Wen Chen, Shaoyuan Chen

With the explosively increasing demands on the network capacity, throughput and number of connected wireless devices, massive connectivity is an urgent problem for the next generation wireless communications.

Dictionary Learning Scheduling

Cannot find the paper you are looking for? You can Submit a new open access paper.