Search Results for author: Peter Conway Humphreys

Found 2 papers, 1 papers with code

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

1 code implementation2 Apr 2024 David Raposo, Sam Ritter, Blake Richards, Timothy Lillicrap, Peter Conway Humphreys, Adam Santoro

Our method enforces a total compute budget by capping the number of tokens ($k$) that can participate in the self-attention and MLP computations at a given layer.

Evolving Neural Update Rules for Sequence Learning

no code implementations29 Sep 2021 Karol Gregor, Peter Conway Humphreys

We consider the problem of searching, end to end, for effective weight and activation update rules governing online learning of a recurrent network on problems of character sequence memorisation and prediction.

Cannot find the paper you are looking for? You can Submit a new open access paper.