Search Results for author: Shivendra Bhardwaj

Knowledge Distillation with Noisy Labels for Natural Language Understanding

Knowledge Distillation (KD) is extensively used to compress and deploy large pre-trained language models on edge devices for real-world applications.

Paper
Add Code

Deep neural models tremendously improved machine translation.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.