Search Results for author: Cheng Cao

Found 3 papers, 1 papers with code

Efficient Toxic Content Detection by Bootstrapping and Distilling Large Language Models

no code implementations13 Dec 2023 Jiang Zhang, Qiong Wu, Yiming Xu, Cheng Cao, Zheng Du, Konstantinos Psounis

Furthermore, student LMs fine-tuned with rationales extracted via DToT outperform baselines on all datasets with up to 16. 9\% accuracy improvement, while being more than 60x smaller than conventional LLMs.

In-Context Learning

Human Transcription Quality Improvement

1 code implementation24 Sep 2023 Jian Gao, Hanbo Sun, Cheng Cao, Zheng Du

We collect and release LibriCrowd - a large-scale crowdsourced dataset of audio transcriptions on 100 hours of English speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Cannot find the paper you are looking for? You can Submit a new open access paper.