1 code implementation • 22 Dec 2023 • Rongsheng Wang, Haoming Chen, Ruizhe Zhou, Yaofei Duan, Kunyan Cai, Han Ma, Jiaxi Cui, Jian Li, Patrick Cheong-Iao Pang, Yapeng Wang, Tao Tan
This work is pioneering in the execution of instruction fine-tuning on a sparse expert-mixed model, marking a significant breakthrough in enhancing the capabilities of this model architecture.
1 code implementation • 20 Jul 2023 • Rongsheng Wang, Yaofei Duan, ChanTong Lam, Jiexi Chen, Jiangsheng Xu, Haoming Chen, Xiaohong Liu, Patrick Cheong-Iao Pang, Tao Tan
General large language models (LLMs) such as ChatGPT have shown remarkable success.
no code implementations • 26 Mar 2021 • Qi-Qiao He, Patrick Cheong-Iao Pang, Yain-Whar Si
Majority of existing works on transfer learning are based on single-source transfer learning due to the availability of open-access large-scale datasets.