Search Results for author: Valerie Morris

Found 1 papers, 0 papers with code

Attention-Only Transformers and Implementing MLPs with Attention Heads

no code implementations15 Sep 2023 Robert Huben, Valerie Morris

The transformer architecture is widely used in machine learning models and consists of two alternating sublayers: attention heads and MLPs.

Cannot find the paper you are looking for? You can Submit a new open access paper.