no code implementations • 28 Sep 2021 • Sunah Min, Jinyoung Moon
Consequently, the forget gate of the original LSTM can lose the accumulated information relevant to the current action because it determines which information to forget without considering the current action.