no code implementations • 8 Jun 2023 • Zhe Bian, Zhe Wang, Wenqiang Han, Kangping Wang
To tackle these issues, we propose a novel token pruning method that retains information from non-crucial tokens by merging them with more crucial tokens, thereby mitigating the impact of pruning on model performance.