no code implementations • 8 Mar 2024 • Zuguang Li, Wen Wu, Shaohua Wu, Wei Wang
Then, a two-layer optimization method is proposed to solve the MIP problem.
2 code implementations • 27 Nov 2023 • Shaohua Wu, Xudong Zhao, Shenling Wang, Jiangang Luo, Lingjun Li, Xi Chen, Bing Zhao, Wei Wang, Tong Yu, Rongguo Zhang, Jiahua Zhang, Chao Wang
In this work, we develop and release Yuan 2. 0, a series of large language models with parameters ranging from 2. 1 billion to 102. 6 billion.
1 code implementation • 10 Oct 2021 • Shaohua Wu, Xudong Zhao, Tong Yu, Rongguo Zhang, Chong Shen, Hongli Liu, Feng Li, Hong Zhu, Jiangang Luo, Liang Xu, Xuanwei Zhang
With this method, Yuan 1. 0, the current largest singleton language model with 245B parameters, achieves excellent performance on thousands GPUs during training, and the state-of-the-art results on NLP tasks.
no code implementations • 26 Sep 2021 • Yifei Qiu, Shaohua Wu, Ying Wang
The monotonicity of the value function in MDP is characterized and then used to show the threshold structure properties of the optimal scheduling policy.