Search Results for author: Artyom Sorokin

Found 2 papers, 2 papers with code

In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss

2 code implementations16 Feb 2024 Yuri Kuratov, Aydar Bulatov, Petr Anokhin, Dmitry Sorokin, Artyom Sorokin, Mikhail Burtsev

This paper addresses the challenge of processing long documents using generative transformer models.

Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain Outcomes

1 code implementation27 Jul 2022 Artyom Sorokin, Nazar Buzun, Leonid Pugachev, Mikhail Burtsev

This requires to store prohibitively large intermediate data if a sequence consists of thousands or even millions elements, and as a result, makes learning of very long-term dependencies infeasible.

Cannot find the paper you are looking for? You can Submit a new open access paper.