1 code implementation • 27 Mar 2024 • Chengbo Liu, Yong Zhu
The core strategies involve: 1) Fine-tune the model by incorporating semantic adaptive tokens that possess flexible decoding capabilities without changing its structure, allowing them to generate high-quality draft tokens.