Search Results for author: Zhitian Xie

Found 2 papers, 1 papers with code

MoDE: A Mixture-of-Experts Model with Mutual Distillation among the Experts

no code implementations31 Jan 2024 Zhitian Xie, Yinger Zhang, Chenyi Zhuang, Qitao Shi, Zhining Liu, Jinjie Gu, Guannan Zhang

However, the gate's routing mechanism also gives rise to narrow vision: the individual MoE's expert fails to use more samples in learning the allocated sub-task, which in turn limits the MoE to further improve its generalization ability.

Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy

1 code implementation20 Dec 2023 Yao Zhao, Zhitian Xie, Chenyi Zhuang, Jinjie Gu

Hence, this paper presents a generic framework for accelerating the inference process, resulting in a substantial increase in speed and cost reduction for our RAG system, with lossless generation accuracy.

Language Modelling Large Language Model +3

Cannot find the paper you are looking for? You can Submit a new open access paper.