no code implementations • 3 Apr 2024 • Ye Yuan, Kexin Tang, Jianhao Shen, Ming Zhang, Chenguang Wang
This enables the direct comparison of the social understanding of large language models to humans, more specifically, elementary students.
no code implementations • 24 Jan 2024 • Guoxin Chen, Kexin Tang, Chao Yang, Fuying Ye, Yu Qiao, Yiming Qian
Moreover, existing reinforcement learning (RL) based methods overlook the structured relationships, underutilizing the potential of RL in structured reasoning.
no code implementations • NeurIPS 2019 • Hoi-To Wai, Mingyi Hong, Zhuoran Yang, Zhaoran Wang, Kexin Tang
Policy evaluation with smooth and nonlinear function approximation has shown great potential for reinforcement learning.