no code implementations • 14 Mar 2024 • Tianqi Zheng, Nicolas Loizou, Pengcheng You, Enrique Mallada
Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e. g., in bilinear settings.
no code implementations • 19 Oct 2023 • Tianqi Zheng, John W. Simpson-Porco, Enrique Mallada
The standard approach to combine these methodologies comprises an offline/open-loop stage, planning, that designs a feasible and safe trajectory to follow, and an online/closed-loop stage, tracking, that corrects for unmodeled dynamics and disturbances.
no code implementations • 3 Dec 2022 • Tianqi Zheng, Pengcheng You, Enrique Mallada
In constrained reinforcement learning (C-RL), an agent seeks to learn from the environment a policy that maximizes the expected cumulative reward while satisfying minimum requirements in secondary cumulative reward constraints.