Search Results for author: Huiyin Xue

Found 2 papers, 0 papers with code

Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention

no code implementations11 Oct 2023 Huiyin Xue, Nikolaos Aletras

Scaling pre-trained language models has resulted in large performance gains in various natural language processing tasks but comes with a large cost in memory requirements.

Cannot find the paper you are looking for? You can Submit a new open access paper.