1 code implementation • 2 Nov 2023 • Zhenjie Yang, Xiaosong Jia, Hongyang Li, Junchi Yan
Recently, large language models (LLMs) have demonstrated abilities including understanding context, logical reasoning, and generating answers.
1 code implementation • NeurIPS 2023 • Yazhe Niu, Yuan Pu, Zhenjie Yang, Xueyan Li, Tong Zhou, Jiyuan Ren, Shuai Hu, Hongsheng Li, Yu Liu
Building agents based on tree-search planning capabilities with learned models has achieved remarkable success in classic decision-making problems, such as Go and Atari.
no code implementations • 1 Jun 2023 • Lu Li, Jiafei Lyu, Guozheng Ma, Zilin Wang, Zhenjie Yang, Xiu Li, Zhiheng Li
Though normalization techniques have demonstrated huge success in supervised and unsupervised learning, their applications in visual RL are still scarce.