no code implementations • CVPR 2021 • Francisco Rivera Valverde, Juana Valeria Hurtado, Abhinav Valada
In this work, we present the novel self-supervised MM-DistillNet framework consisting of multiple teachers that leverage diverse modalities including RGB, depth and thermal images, to simultaneously exploit complementary cues and distill knowledge into a single audio student network.