no code implementations • 19 May 2023 • Shiyao Ding, Takayuki Ito
In this paper, we propose Self-Agreement, a novel framework for fine-tuning LLMs to autonomously find agreement using data generated by LLM itself.
no code implementations • 2 Aug 2022 • Jakub Grudzien Kuba, Xidong Feng, Shiyao Ding, Hao Dong, Jun Wang, Yaodong Yang
The necessity for cooperation among intelligent machines has popularised cooperative multi-agent reinforcement learning (MARL) in the artificial intelligence (AI) research community.
no code implementations • 24 Dec 2020 • Patrick Ocheja, Yang Cao, Shiyao Ding, Masatoshi Yoshikawa
How to contain the spread of the COVID-19 virus is a major concern for most countries.
Computers and Society Cryptography and Security 68P27 H.3.4