no code implementations • 22 Mar 2024 • Haoyu Wang, Xiaoyu Tan, Xihe Qiu, Chao Qu
Effective coordination is crucial for motion control with reinforcement learning, especially as the complexity of agents and their motions increases.
no code implementations • 9 Dec 2023 • Zhenting Qi, Xiaoyu Tan, Shaojie Shi, Chao Qu, Yinghui Xu, Yuan Qi
Instruction fine-tuning has conventionally been employed to adapt Large Language Models (LLMs) to a variety of tasks.
1 code implementation • ICCV 2023 • Xihe Qiu, Shaojie Shi, Xiaoyu Tan, Chao Qu, Zhijun Fang, Hailing Wang, Yongbin Gao, Peixia Wu, Huawei Li
Video nystagmography (VNG) is the diagnostic gold standard of benign paroxysmal positional vertigo (BPPV), which requires medical professionals to examine the direction, frequency, intensity, duration, and variation in the strength of nystagmus on a VNG video.
1 code implementation • 31 May 2022 • Siqiao Xue, Chao Qu, Xiaoming Shi, Cong Liao, Shiyi Zhu, Xiaoyu Tan, Lintao Ma, Shiyu Wang, Shijun Wang, Yun Hu, Lei Lei, Yangfei Zheng, Jianguo Li, James Zhang
Predictive autoscaling (autoscaling with workload forecasting) is an important mechanism that supports autonomous adjustment of computing resources in accordance with fluctuating workload demands in the Cloud.
1 code implementation • 29 Jan 2022 • Chao Qu, Xiaoyu Tan, Siqiao Xue, Xiaoming Shi, James Zhang, Hongyuan Mei
We consider a sequential decision making problem where the agent faces the environment characterized by the stochastic discrete events and seeks an optimal intervention policy such that its long-term reward is maximized.
no code implementations • 16 Jun 2020 • Xiaoyu Tan, Chao Qu, Junwu Xiong, James Zhang
Model-based reinforcement learning (MBRL) has shown its advantages in sample-efficiency over model-free reinforcement learning (MFRL).
Model-based Reinforcement Learning reinforcement-learning +1
no code implementations • 25 Sep 2019 • Xiaoyu Tan, Chao Qu, Junwu Xiong, James Zhang
In this paper, we propose a simple and elegant model-based reinforcement learning algorithm called soft stochastic value gradient method (S2VG).
Model-based Reinforcement Learning reinforcement-learning +1