1 code implementation • Findings (ACL) 2022 • Houman Mehrafarin, Sara Rajaee, Mohammad Taher Pilehvar
The analysis also reveals that larger training data mainly affects higher layers, and that the extent of this change is a factor of the number of iterations updating the model during fine-tuning rather than the diversity of the training samples.