Search Results for author: Peter Conway Humphreys

Found 2 papers, 1 papers with code

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

1 code implementation • 2 Apr 2024 • David Raposo, Sam Ritter, Blake Richards, Timothy Lillicrap, Peter Conway Humphreys, Adam Santoro

Our method enforces a total compute budget by capping the number of tokens ($k$) that can participate in the self-attention and MLP computations at a given layer.

1,087

Paper
Code

Evolving Neural Update Rules for Sequence Learning

no code implementations • 29 Sep 2021 • Karol Gregor, Peter Conway Humphreys

We consider the problem of searching, end to end, for effective weight and activation update rules governing online learning of a recurrent network on problems of character sequence memorisation and prediction.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.