Search Results for author: Zhenpeng Su

Found 5 papers, 4 papers with code

Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token Removal

no code implementations • 27 Apr 2024 • Haoran Lian, Yizhe Xiong, Jianwei Niu, Shasha Mo, Zhenpeng Su, Zijia Lin, Peng Liu, Hui Chen, Guiguang Ding

Due to their infrequent appearance in the text corpus, Scaffold Tokens pose a learning imbalance issue for language models.

Language Modelling Machine Translation

Paper
Add Code

MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models

1 code implementation • 30 Oct 2023 • Zhenpeng Su, Xing Wu, Xue Bai, Zijia Lin, Hui Chen, Guiguang Ding, Wei Zhou, Songlin Hu

Experiments reveal that models incorporating the proposed MiLe Loss can gain consistent performance improvement on downstream benchmarks.

Ranked #89 on Multi-task Language Understanding on MMLU

Language Modelling Multi-task Language Understanding

Paper
Code

HC3 Plus: A Semantic-Invariant Human ChatGPT Comparison Corpus

1 code implementation • 6 Sep 2023 • Zhenpeng Su, Xing Wu, Wei Zhou, Guangyuan Ma, Songlin Hu

ChatGPT has gained significant interest due to its impressive performance, but people are increasingly concerned about its potential risks, particularly around the detection of AI-generated content (AIGC), which is often difficult for untrained humans to identify.

Question Answering