Search Results for author: Yifan Yanggong

Found 3 papers, 1 papers with code

Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating

no code implementations21 Feb 2024 Yifan Yanggong, Hao Pan, Lei Wang

Games are a simplified model of reality and often serve as a favored platform for Artificial Intelligence (AI) research.

Decision Making

Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning

no code implementations16 May 2023 Hao Chen, Yiming Zhang, Qi Zhang, Hantao Yang, Xiaomeng Hu, Xuetao Ma, Yifan Yanggong, Junbo Zhao

Instruction tuning for large language models (LLMs) has gained attention from researchers due to its ability to unlock the potential of LLMs in following instructions.

Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and Credibility

1 code implementation15 May 2023 Wentao Ye, Mingfeng Ou, Tianyi Li, Yipeng chen, Xuetao Ma, Yifan Yanggong, Sai Wu, Jie Fu, Gang Chen, Haobo Wang, Junbo Zhao

With most of the related literature in the era of LLM uncharted, we propose an automated workflow that copes with an upscaled number of queries/responses.

Memorization

Cannot find the paper you are looking for? You can Submit a new open access paper.