1 code implementation • 17 Jul 2023 • Alvin Wan, Hanxiang Hao, Kaushik Patnaik, Yueyang Xu, Omer Hadad, David Güera, Zhile Ren, Qi Shan
However, for multi-branch segments of a model, channel removal can introduce inference-time memory copies.