no code implementations • 23 Oct 2023 • Daniel Biermann, Fabrizio Palumbo, Morten Goodwin, Ole-Christoffer Granmo
As far as we are aware, no model uses the sequence length reduction step as an additional opportunity to tune the models performance.