no code implementations • 22 Jan 2022 • Yunling Zheng, Carson Hu, Guang Lin, Meng Yue, Bao Wang, Jack Xin
Due to the sparsified queries, GLassoformer is more computationally efficient than the standard transformers.