Search Results for author: VS Subrahmanian

Found 1 papers, 1 papers with code

Higher Layers Need More LoRA Experts

1 code implementation13 Feb 2024 Chongyang Gao, Kezhen Chen, Jinmeng Rao, Baochen Sun, Ruibo Liu, Daiyi Peng, Yawen Zhang, Xiaoyuan Guo, Jie Yang, VS Subrahmanian

In this paper, we introduce a novel parameter-efficient MoE method, \textit{\textbf{M}oE-L\textbf{o}RA with \textbf{L}ayer-wise Expert \textbf{A}llocation (MoLA)} for Transformer-based models, where each model layer has the flexibility to employ a varying number of LoRA experts.

Cannot find the paper you are looking for? You can Submit a new open access paper.