1 code implementation • 15 Apr 2024 • Divyang Doshi, Jung-eun Kim
In our work, we propose an efficient method for generating these soft labels, thereby eliminating the need for a large teacher model.
Knowledge Distillation