no code implementations • 2 Apr 2024 • Ruqi Liao, Chuqing Zhao, Jin Li, Weiqi Feng
In response to the rising interest in large multimodal models, we introduce Cross-Attention Token Pruning (CATP), a precision-focused token pruning method.
Computational Efficiency