Search Results for author: Tamir David Hay

Found 1 papers, 0 papers with code

Dynamic Layer Tying for Parameter-Efficient Transformers

no code implementations23 Jan 2024 Tamir David Hay, Lior Wolf

In the pursuit of reducing the number of trainable parameters in deep transformer networks, we employ Reinforcement Learning to dynamically select layers during training and tie them together.

Cannot find the paper you are looking for? You can Submit a new open access paper.