1 code implementation • Findings (NAACL) 2022 • Chin-Lun Fu, Zih-Ching Chen, Yun-Ru Lee, Hung-Yi Lee
Transformer-based pre-trained models with millions of parameters require large storage.