1 code implementation • 30 Jun 2022 • Jingping Liu, Yuqiu Song, Kui Xue, Hongli Sun, Chao Wang, Lihan Chen, Haiyun Jiang, Jiaqing Liang, Tong Ruan
Specifically, we focus on layer tuning for feed-forward network in the Transformer, namely FL-tuning.
Model Optimization