Search Results for author: Aman Timalsina

Found 3 papers, 3 papers with code

Simple linear attention language models balance the recall-throughput tradeoff

1 code implementation • 28 Feb 2024 • Simran Arora, Sabri Eyuboglu, Michael Zhang, Aman Timalsina, Silas Alberti, Dylan Zinsley, James Zou, Atri Rudra, Christopher Ré

In this work, we explore whether we can improve language model efficiency (e. g. by reducing memory consumption) without compromising on recall.

Language Modelling Text Generation

162

Paper
Code

Zoology: Measuring and Improving Recall in Efficient Language Models

2 code implementations • 8 Dec 2023 • Simran Arora, Sabri Eyuboglu, Aman Timalsina, Isys Johnson, Michael Poli, James Zou, Atri Rudra, Christopher Ré

To close the gap between synthetics and real language, we develop a new formalization of the task called multi-query associative recall (MQAR) that better reflects actual language.

162

Paper
Code

How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis Projections

1 code implementation • 24 Jun 2022 • Albert Gu, Isys Johnson, Aman Timalsina, Atri Rudra, Christopher Ré

Linear time-invariant state space models (SSM) are a classical model from engineering and statistics, that have recently been shown to be very promising in machine learning through the Structured State Space sequence model (S4).

Ranked #7 on Long-range modeling on LRA

Long-range modeling

2,113

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.