no code implementations • 6 Mar 2023 • Kun Li, Zhixi Shen, Gangshan Jing, Yongduan Song
Angle-constrained formation control has attracted much attention from control community due to the advantage that inter-edge angles are invariant under uniform translations, rotations, and scalings of the whole formation.
no code implementations • 26 Feb 2022 • Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty, Piyush K. Sharma
Achieving distributed reinforcement learning (RL) for large-scale cooperative multi-agent systems (MASs) is challenging because: (i) each agent has access to only limited information; (ii) issues on convergence or computational complexity emerge due to the curse of dimensionality.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 10 Jan 2022 • Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty, Piyush. K. Sharma
In this work, we study MARLs with directed coordination graphs, and propose a distributed RL algorithm where the local policy evaluations are based on local value functions.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 26 Jul 2021 • Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty, Piyush K. Sharma
Recently introduced distributed zeroth-order optimization (ZOO) algorithms have shown their utility in distributed reinforcement learning (RL).
no code implementations • 16 Oct 2020 • Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty
Conditions for decomposability, an algorithm for constructing the transformation matrix, a parallel RL algorithm, and robustness analysis when the design is applied to non-homogeneous MAS are presented.
Hierarchical Reinforcement Learning Reinforcement Learning (RL)
no code implementations • 14 Aug 2020 • Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty
The first component optimizes the performance of each independent cluster by solving the smaller-size LQR design problem in a model-free way using an RL algorithm.
no code implementations • 3 Dec 2019 • Gangshan Jing, Changhuang Wan, Ran Dai
Graphical conditions for equivalence of the formulated rank-constrained SDP and a linear SDP, decomposition of the SDP, as well as the effectiveness of the distributed protocol, are proposed, respectively.