no code implementations • 25 Apr 2024 • Praveen Kumar Ranjan, Abhinav Sinha, Yongcan Cao
In this paper, we address the problem of enclosing an arbitrarily moving target in three dimensions by a single pursuer, which is an unmanned aerial vehicle (UAV), for maximum coverage while also ensuring the pursuer's safety by preventing collisions with the target.
no code implementations • 6 Apr 2024 • Praveen Kumar Ranjan, Abhinav Sinha, Yongcan Cao
The proposed control eliminates the need for a fixed or pre-established agent arrangement around the target and requires only relative information between an agent and the target.
no code implementations • 30 Jul 2023 • Devin White, Mingkang Wu, Ellen Novoseller, Vernon J. Lawhern, Nicholas Waytowich, Yongcan Cao
This paper develops a novel rating-based reinforcement learning approach that uses human ratings to obtain human guidance in reinforcement learning.
no code implementations • 16 Jun 2023 • Umer Siddique, Abhinav Sinha, Yongcan Cao
Toward this objective, we design a new fairness-induced preference-based reinforcement learning or FPbRL.
no code implementations • 29 Sep 2021 • Feng Tao, Yongcan Cao
We also show the addition of the agent’s policy entropy at the next state yields new soft Q function and state value function that are concise and modular.
1 code implementation • 15 Oct 2020 • Huixin Zhan, Feng Tao, Yongcan Cao
To reduce and minimize the need for human queries, we propose a new GAN-assisted human preference-based reinforcement learning approach that uses a generative adversarial network (GAN) to actively learn human preferences and then replace the role of human in assigning preferences.
no code implementations • 21 Sep 2020 • Feng Tao, Rengan Suresh, Johnathan Votion, Yongcan Cao
Based on the dual graph and graph theory, we then generalize MLKM to G-MLKM by first understanding local data-target association and then extracting cross-local data-target association mathematically analyze the data association at intersections of that space.
no code implementations • 21 Sep 2020 • Feng Tao, Yongcan Cao
In this paper, we study the problem of obtaining a control policy that can mimic and then outperform expert demonstrations in Markov decision processes where the reward function is unknown to the learning agent.
no code implementations • 4 Dec 2019 • Huixin Zhan, Wei-Ming Lin, Yongcan Cao
Besides accuracy, the model size of convolutional neural networks (CNN) models is another important factor considering limited hardware resources in practical applications.
no code implementations • 2 Oct 2019 • Huixin Zhan, Yongcan Cao
Solving multi-objective optimization problems is important in various applications where users are interested in obtaining optimal policies subject to multiple, yet often conflicting objectives.
no code implementations • 26 Sep 2019 • Huixin Zhan, Yongcan Cao
We demonstrate the effectiveness of the proposed approach via a MuJoCo based robotics case study.
Multi-Objective Reinforcement Learning reinforcement-learning
no code implementations • 30 May 2017 • Samuel Silva, Rengan Suresh, Feng Tao, Johnathan Votion, Yongcan Cao
Data-target association is an important step in multi-target localization for the intelligent operation of un- manned systems in numerous applications such as search and rescue, traffic management and surveillance.
no code implementations • 28 Feb 2017 • Kasthurirengan Suresh, Samuel Silva, Johnathan Votion, Yongcan Cao
Data-target pairing is an important step towards multi-target localization for the intelligent operation of unmanned systems.