1 code implementation • ICLR 2021 • Jonathan Pilault, Amine Elhattami, Christopher Pal
Through this construction (a hypernetwork adapter), we achieve more efficient parameter sharing and mitigate forgetting by keeping half of the weights of a pretrained model fixed.
Ranked #1 on Natural Language Inference on SciTail