no code implementations • 27 Apr 2024 • Haoran Lian, Yizhe Xiong, Jianwei Niu, Shasha Mo, Zhenpeng Su, Zijia Lin, Peng Liu, Hui Chen, Guiguang Ding
Due to their infrequent appearance in the text corpus, Scaffold Tokens pose a learning imbalance issue for language models.
1 code implementation • 30 Oct 2023 • Zhenpeng Su, Xing Wu, Xue Bai, Zijia Lin, Hui Chen, Guiguang Ding, Wei Zhou, Songlin Hu
Experiments reveal that models incorporating the proposed MiLe Loss can gain consistent performance improvement on downstream benchmarks.
Ranked #89 on Multi-task Language Understanding on MMLU
1 code implementation • 6 Sep 2023 • Zhenpeng Su, Xing Wu, Wei Zhou, Guangyuan Ma, Songlin Hu
ChatGPT has gained significant interest due to its impressive performance, but people are increasingly concerned about its potential risks, particularly around the detection of AI-generated content (AIGC), which is often difficult for untrained humans to identify.
1 code implementation • 14 Jun 2023 • Yuntao Li, Zhenpeng Su, Yutian Li, Hanchu Zhang, Sirui Wang, Wei Wu, Yan Zhang
Translating natural language queries into SQLs in a seq2seq manner has attracted much attention recently.
Ranked #8 on Text-To-SQL on spider
1 code implementation • 7 Jun 2023 • Zhenpeng Su, Xing Wu, Wei Zhou, Guangyuan Ma, Songlin Hu
Dialogue response selection aims to select an appropriate response from several candidates based on a given user and system utterance history.
Ranked #1 on Conversational Response Selection on E-commerce