Search Results for author: HaoYi Wu

Found 3 papers, 3 papers with code

Layer-Condensed KV Cache for Efficient Inference of Large Language Models

1 code implementation17 May 2024 HaoYi Wu, Kewei Tu

In this paper, we propose a novel method that only computes and caches the KVs of a small number of layers, thus significantly saving memory consumption and improving inference throughput.

Language Modelling

Probabilistic Transformer: A Probabilistic Dependency Model for Contextual Word Representation

1 code implementation26 Nov 2023 HaoYi Wu, Kewei Tu

Specifically, we design a conditional random field that models discrete latent representations of all words in a sentence as well as dependency arcs between them; and we use mean field variational inference for approximate inference.

Sentence Variational Inference

Conic10K: A Challenging Math Problem Understanding and Reasoning Dataset

1 code implementation9 Nov 2023 HaoYi Wu, Wenyang Hui, Yezeng Chen, Weiqi Wu, Kewei Tu, Yi Zhou

Since the dataset only involves a narrow range of knowledge, it is easy to separately analyse the knowledge a model possesses and the reasoning ability it has.

Math Natural Language Understanding

Cannot find the paper you are looking for? You can Submit a new open access paper.