no code implementations • 22 Feb 2024 • Xinglin Zhou, Yifu Yuan, Shaofu Yang, Jianye Hao
To address the issue, We propose a general hierarchical reinforcement learning framework incorporating human feedback and dynamic distance constraints (MENTOR).
no code implementations • 6 Dec 2022 • Luyao Guo, Jinde Cao, Xinli Shi, Shaofu Yang
In this paper, we propose a novel primal-dual proximal splitting algorithm (PD-PSA), named BALPA, for the composite optimization problem with equality constraints, where the loss function consists of a smooth term and a nonsmooth term composed with a linear mapping.
no code implementations • 5 Sep 2022 • Luyao Guo, Xinli Shi, Shaofu Yang, Jinde Cao
In this paper, we propose a novel Dual Inexact Splitting Algorithm (DISA) for distributed convex composite optimization problems, where the local loss function consists of a smooth term and a possibly nonsmooth term composed with a linear mapping.