no code implementations • 26 Aug 2023 • Sharath Koorathota, Nikolas Papadopoulos, Jia Li Ma, Shruti Kumar, Xiaoxiao Sun, Arunesh Mittal, Patrick Adelman, Paul Sajda
We find that the ViT performance is improved in accuracy and number of training epochs when using JSF and FAX.