Search Results for author: Chenyu Jiang

Found 3 papers, 1 papers with code

DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

2 code implementations17 Nov 2023 Chenyu Jiang, Zhen Jia, Shuai Zheng, Yida Wang, Chuan Wu

This paper proposes a dynamic micro-batching approach to tackle sequence length variation and enable efficient multi-task model training.

Language Modelling Large Language Model +3

dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training

no code implementations5 May 2022 Hanpeng Hu, Chenyu Jiang, Yuchen Zhong, Yanghua Peng, Chuan Wu, Yibo Zhu, Haibin Lin, Chuanxiong Guo

Distributed training using multiple devices (e. g., GPUs) has been widely adopted for learning DNN models over large datasets.

Cannot find the paper you are looking for? You can Submit a new open access paper.