Search Results for author: Zhitian Xie

Found 2 papers, 1 papers with code

MoDE: A Mixture-of-Experts Model with Mutual Distillation among the Experts

no code implementations31 Jan 2024 Zhitian Xie, Yinger Zhang, Chenyi Zhuang, Qitao Shi, Zhining Liu, Jinjie Gu, Guannan Zhang

However, the gate's routing mechanism also gives rise to narrow vision: the individual MoE's expert fails to use more samples in learning the allocated sub-task, which in turn limits the MoE to further improve its generalization ability.

Lookahead: An Inference Acceleration Framework for Large Language Model with Lossless Generation Accuracy

1 code implementation20 Dec 2023 Yao Zhao, Zhitian Xie, Chen Liang, Chenyi Zhuang, Jinjie Gu

Instead of generating a single token at a time, we propose a Trie-based retrieval and verification mechanism to be able to accept several tokens at a forward step.

Language Modelling Large Language Model +3

Cannot find the paper you are looking for? You can Submit a new open access paper.