no code implementations • 8 Sep 2023 • Zuojin Tang, Bo Sun, Tongwei Ma, Daosheng Li, Zhenhui Xu
The teacher network supervises the classification and regression of the student network using the pre-trained model on ImageNet.
1 code implementation • 4 Jun 2021 • Zhenhui Xu, Meng Zhao, Liqun Liu, Lei Xiao, Xiaopeng Zhang, Bifeng Zhang
This paper introduces a novel multi-task model called Mixture of Virtual-Kernel Experts (MVKE) to learn user preferences on various actions and topics unitedly.
1 code implementation • 10 Jun 2020 • Zhenhui Xu, Linyuan Gong, Guolin Ke, Di He, Shuxin Zheng, Li-Wei Wang, Jiang Bian, Tie-Yan Liu
Pre-trained contextual representations (e. g., BERT) have become the foundation to achieve state-of-the-art results on many NLP tasks.
2 code implementations • 16 Jul 2019 • Zhenhui Xu, Guolin Ke, Jia Zhang, Jiang Bian, Tie-Yan Liu
Inspired by the nature of the expressiveness ability in Neural Networks, we propose to use multi-segment activation, which can significantly improve the expressiveness ability with very little cost, in the compact student model.
no code implementations • ICLR 2019 • Guolin Ke, Jia Zhang, Zhenhui Xu, Jiang Bian, Tie-Yan Liu
Since there are no shared patterns among these diverse tabular data, it is hard to design specific structures to fit them all.