1 code implementation • 13 Dec 2023 • Yanling Tian, Di Chen, Yunan Liu, Jian Yang, Shanshan Zhang
To the best of our knowledge, this is the first work that investigates how to support full-task pre-training using sub-task data.
no code implementations • 23 Sep 2022 • Yanling Tian, Di Chen, Yunan Liu, Shanshan Zhang, Jian Yang
A straightforward solution is to manually assign different weights to different tasks, compensating for the diverse convergence rates.