Search Results for author: Yongcan Cao

Found 13 papers, 1 papers with code

3D Guidance Law for Maximal Coverage and Target Enclosing with Inherent Safety

no code implementations25 Apr 2024 Praveen Kumar Ranjan, Abhinav Sinha, Yongcan Cao

In this paper, we address the problem of enclosing an arbitrarily moving target in three dimensions by a single pursuer, which is an unmanned aerial vehicle (UAV), for maximum coverage while also ensuring the pursuer's safety by preventing collisions with the target.

Self-organizing Multiagent Target Enclosing under Limited Information and Safety Guarantees

no code implementations6 Apr 2024 Praveen Kumar Ranjan, Abhinav Sinha, Yongcan Cao

The proposed control eliminates the need for a fixed or pre-established agent arrangement around the target and requires only relative information between an agent and the target.

Collision Avoidance

Rating-based Reinforcement Learning

no code implementations30 Jul 2023 Devin White, Mingkang Wu, Ellen Novoseller, Vernon J. Lawhern, Nicholas Waytowich, Yongcan Cao

This paper develops a novel rating-based reinforcement learning approach that uses human ratings to obtain human guidance in reinforcement learning.

reinforcement-learning

Fairness in Preference-based Reinforcement Learning

no code implementations16 Jun 2023 Umer Siddique, Abhinav Sinha, Yongcan Cao

Toward this objective, we design a new fairness-induced preference-based reinforcement learning or FPbRL.

Fairness reinforcement-learning

Generalized Maximum Entropy Reinforcement Learning via Reward Shaping

no code implementations29 Sep 2021 Feng Tao, Yongcan Cao

We also show the addition of the agent’s policy entropy at the next state yields new soft Q function and state value function that are concise and modular.

reinforcement-learning Reinforcement Learning (RL)

Human-guided Robot Behavior Learning: A GAN-assisted Preference-based Reinforcement Learning Approach

1 code implementation15 Oct 2020 Huixin Zhan, Feng Tao, Yongcan Cao

To reduce and minimize the need for human queries, we propose a new GAN-assisted human preference-based reinforcement learning approach that uses a generative adversarial network (GAN) to actively learn human preferences and then replace the role of human in assigning preferences.

Generative Adversarial Network reinforcement-learning +1

Graph Based Multi-layer K-means++ (G-MLKM) for Sensory Pattern Analysis in Constrained Spaces

no code implementations21 Sep 2020 Feng Tao, Rengan Suresh, Johnathan Votion, Yongcan Cao

Based on the dual graph and graph theory, we then generalize MLKM to G-MLKM by first understanding local data-target association and then extracting cross-local data-target association mathematically analyze the data association at intersections of that space.

Clustering

Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent Policy Optimization

no code implementations21 Sep 2020 Feng Tao, Yongcan Cao

In this paper, we study the problem of obtaining a control policy that can mimic and then outperform expert demonstrations in Markov decision processes where the reward function is unknown to the learning agent.

reinforcement-learning Reinforcement Learning (RL)

Deep Model Compression Via Two-Stage Deep Reinforcement Learning

no code implementations4 Dec 2019 Huixin Zhan, Wei-Ming Lin, Yongcan Cao

Besides accuracy, the model size of convolutional neural networks (CNN) models is another important factor considering limited hardware resources in practical applications.

Autonomous Driving Model Compression +6

Relationship Explainable Multi-objective Optimization Via Vector Value Function Based Reinforcement Learning

no code implementations2 Oct 2019 Huixin Zhan, Yongcan Cao

Solving multi-objective optimization problems is important in various applications where users are interested in obtaining optimal policies subject to multiple, yet often conflicting objectives.

reinforcement-learning Reinforcement Learning (RL)

A Multi-Layer K-means Approach for Multi-Sensor Data Pattern Recognition in Multi-Target Localization

no code implementations30 May 2017 Samuel Silva, Rengan Suresh, Feng Tao, Johnathan Votion, Yongcan Cao

Data-target association is an important step in multi-target localization for the intelligent operation of un- manned systems in numerous applications such as search and rescue, traffic management and surveillance.

Clustering Management

Cannot find the paper you are looking for? You can Submit a new open access paper.