no code implementations • 26 Nov 2021 • Zahra Fayyaz, Aya Altamimi, Sen Cheng, Laurenz Wiskott
We assume that attention selects some part of the index matrix while others are discarded, this then represents the gist of the episode and is stored as a memory trace.