Search Results for author: Cheng Cao

Found 3 papers, 1 papers with code

Efficient Toxic Content Detection by Bootstrapping and Distilling Large Language Models

no code implementations • 13 Dec 2023 • Jiang Zhang, Qiong Wu, Yiming Xu, Cheng Cao, Zheng Du, Konstantinos Psounis

Furthermore, student LMs fine-tuned with rationales extracted via DToT outperform baselines on all datasets with up to 16. 9\% accuracy improvement, while being more than 60x smaller than conventional LLMs.

In-Context Learning

Paper
Add Code

Human Transcription Quality Improvement

1 code implementation • 24 Sep 2023 • Jian Gao, Hanbo Sun, Cheng Cao, Zheng Du

We collect and release LibriCrowd - a large-scale crowdsourced dataset of audio transcriptions on 100 hours of English speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

HTEC: Human Transcription Error Correction

no code implementations • 18 Sep 2023 • Hanbo Sun, Jian Gao, Xiaomin Wu, Anjie Fang, Cheng Cao, Zheng Du

Therefore, we propose HTEC for Human Transcription Error Correction.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.