Search Results for author: Hongpeng Cao

Found 7 papers, 1 papers with code

Physical Deep Reinforcement Learning: Safety and Unknown Unknowns

no code implementations26 May 2023 Hongpeng Cao, Yanbing Mao, Lui Sha, Marco Caccamo

In this paper, we propose the Phy-DRL: a physics-model-regulated deep reinforcement learning framework for safety-critical autonomous systems.

reinforcement-learning

Physical Deep Reinforcement Learning Towards Safety Guarantee

no code implementations29 Mar 2023 Hongpeng Cao, Yanbing Mao, Lui Sha, Marco Caccamo

Deep reinforcement learning (DRL) has achieved tremendous success in many complex decision-making tasks of autonomous systems with high-dimensional state and/or action spaces.

Decision Making reinforcement-learning

Sandboxing (AI-based) Unverified Controllers in Stochastic Games: An Abstraction-based Approach with Safe-visor Architecture

no code implementations28 Mar 2022 Bingzhuo Zhong, Hongpeng Cao, Majid Zamani, Marco Caccamo

In this paper, we propose a construction scheme for a Safe-visor architecture for sandboxing unverified controllers, e. g., artificial intelligence-based (a. k. a.

Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement Learning

no code implementations4 Mar 2022 Hongpeng Cao, Mirco Theile, Federico G. Wyrwal, Marco Caccamo

To overcome the reality gap, our architecture exploits sim-to-real transfer strategies to continue the training of simulation-pretrained agents on a physical system.

Domain Adaptation reinforcement-learning +1

Safe-visor Architecture for Sandboxing (AI-based) Unverified Controllers in Stochastic Cyber-Physical Systems

no code implementations10 Feb 2021 Bingzhuo Zhong, Abolfazl Lavaei, Hongpeng Cao, Majid Zamani, Marco Caccamo

To cope with this difficulty, we propose in this work a Safe-visor architecture for sandboxing unverified controllers in CPSs operating in noisy environments (a. k. a.

Cannot find the paper you are looking for? You can Submit a new open access paper.