no code implementations • 9 Mar 2020 • Wonchul Son, Youngbin Kim, Wonseok Song, Youngsu Moon, Wonjun Hwang
We note three points about training student model, caused by applying on-the-fly filter.
Knowledge Distillation Model Compression