no code implementations • 6 Feb 2024 • Ningyuan Tang, Minghao Fu, Ke Zhu, Jianxin Wu
Because learnable parameters from these methods are entangled with the pretrained model, gradients related to the frozen pretrained model's parameters have to be computed and stored during finetuning.