no code implementations • 16 Mar 2023 • Boren Hu, Yun Zhu, Jiacheng Li, Siliang Tang
In this paper, we propose a novel dynamic early exiting combined with layer skipping for BERT inference named SmartBERT, which adds a skipping gate and an exiting operator into each layer of BERT.