Search Results for author: Xiaoming Duan

Found 9 papers, 0 papers with code

Pursuit Winning Strategies for Reach-Avoid Games with Polygonal Obstacles

no code implementations10 Mar 2024 Rui Yan, Shuai Mi, Xiaoming Duan, Jintao Chen, Xiangyang Ji

The pursuers cooperate to protect a convex region from the evaders who try to reach the region.

Inverse Reinforcement Learning with Unknown Reward Model based on Structural Risk Minimization

no code implementations27 Dec 2023 Chendi Qu, Jianping He, Xiaoming Duan, Jiming Chen

A simplistic model is less likely to contain the real reward function, while a model with high complexity leads to substantial computation cost and risks overfitting.

Model Selection

Observation-based Optimal Control Law Learning with LQR Reconstruction

no code implementations27 Dec 2023 Chendi Qu, Jianping He, Xiaoming Duan

Designing controllers to generate various trajectories has been studied for years, while recently, recovering an optimal controller from trajectories receives increasing attention.

Multiplayer Homicidal Chauffeur Reach-Avoid Games: A Pursuit Enclosure Function Approach

no code implementations4 Nov 2023 Rui Yan, Xiaoming Duan, Rui Zou, Xin He, Zongying Shi, Francesco Bullo

We propose a cooperative strategy for the pursuers based on subgames for multiple pursuers against one evader and optimal task allocation.

ERP

A Stochastic Surveillance Stackelberg Game: Co-Optimizing Defense Placement and Patrol Strategy

no code implementations28 Aug 2023 Yohan John, Gilberto Diaz-Garcia, Xiaoming Duan, Jason R. Marden, Francesco Bullo

Stochastic patrol routing is known to be advantageous in adversarial settings; however, the optimal choice of stochastic routing strategy is dependent on a model of the adversary.

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning

no code implementations20 Jan 2023 Haoxuan Pan, Deheng Ye, Xiaoming Duan, Qiang Fu, Wei Yang, Jianping He, Mingfei Sun

We show that, despite such state distribution shift, the policy gradient estimation bias can be reduced in the following three ways: 1) a small learning rate; 2) an adaptive-learning-rate-based optimizer; and 3) KL regularization.

Continuous Control reinforcement-learning +1

Robust Pandemic Control Synthesis with Formal Specifications: A Case Study on COVID-19 Pandemic

no code implementations26 Mar 2021 Zhe Xu, Xiaoming Duan

We provide simulation results in two different scenarios for robust control of the COVID-19 pandemic: one for vaccination control, and another for shield immunity control, with the model parameters estimated from data in Lombardy, Italy.

Policy Evaluation and Seeking for Multi-Agent Reinforcement Learning via Best Response

no code implementations17 Jun 2020 Rui Yan, Xiaoming Duan, Zongying Shi, Yisheng Zhong, Jason R. Marden, Francesco Bullo

With this knowledge we propose a class of perturbed SBRD with the following property: only policies with maximum metric are observed with nonzero probability for a broad class of stochastic games with finite memory.

Multi-agent Reinforcement Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.