Search Results for author: Yizhe Xiong

Found 4 papers, 1 papers with code

Temporal Scaling Law for Large Language Models

no code implementations27 Apr 2024 Yizhe Xiong, Xiansheng Chen, Xin Ye, Hui Chen, Zijia Lin, Haoran Lian, Jianwei Niu, Guiguang Ding

We first investigate the imbalance of loss on each token positions and develop a reciprocal-law across model scales and training stages.

PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation

no code implementations14 Mar 2024 Yizhe Xiong, Hui Chen, Tianxiang Hao, Zijia Lin, Jungong Han, Yuesong Zhang, Guoxin Wang, Yongjun Bao, Guiguang Ding

Consequently, a simple combination of them cannot guarantee accomplishing both training efficiency and inference efficiency with minimal costs.

Model Compression

Confidence-based Visual Dispersal for Few-shot Unsupervised Domain Adaptation

1 code implementation ICCV 2023 Yizhe Xiong, Hui Chen, Zijia Lin, Sicheng Zhao, Guiguang Ding

To address this issue, recent works consider the Few-shot Unsupervised Domain Adaptation (FUDA) where only a few source samples are labeled, and conduct knowledge transfer via self-supervised learning methods.

Self-Supervised Learning Transfer Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.