Search Results for author: Zecheng Wang

Found 4 papers, 2 papers with code

Checkpoint Merging via Bayesian Optimization in LLM Pretraining

no code implementations • 28 Mar 2024 • Deyuan Liu, Zecheng Wang, Bingning Wang, WeiPeng Chen, Chunshan Li, Zhiying Tu, Dianhui Chu, Bo Li, Dianbo Sui

The rapid proliferation of large language models (LLMs) such as GPT-4 and Gemini underscores the intense demand for resources during their training processes, posing significant challenges due to substantial computational and environmental costs.

Bayesian Optimization

Paper
Add Code

Pre-training with Synthetic Data Helps Offline Reinforcement Learning

no code implementations • 1 Oct 2023 • Zecheng Wang, Che Wang, Zixuan Dong, Keith Ross

Recently, it has been shown that for offline deep reinforcement learning (DRL), pre-training Decision Transformer with a large language corpus can improve downstream performance (Reid et al., 2022).

D4RL Q-Learning +1

Paper
Add Code

Robust Unstructured Knowledge Access in Conversational Dialogue with ASR Errors

1 code implementation • 8 Nov 2022 • Yik-Cheung Tam, Jiacheng Xu, Jiakai Zou, Zecheng Wang, Tinglong Liao, Shuhan Yuan

Knowledge cluster classification is boosted from 0. 7924 to 0. 9333 in Recall@1.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Code

Suffix Retrieval-Augmented Language Modeling

1 code implementation • 6 Nov 2022 • Zecheng Wang, Yik-Cheung Tam

SUREALM employs an embedding retriever to search for training sentences in a data store that share similar word history during sequence generation.

Causal Language Modeling Language Modelling +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.