Search Results for author: Homayoon Farrahi

Found 2 papers, 2 papers with code

Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization

1 code implementation9 May 2023 Homayoon Farrahi, A. Rupam Mahmood

In this work, we investigate the widely-used baseline hyper-parameter values of two policy gradient algorithms -- PPO and SAC -- across different cycle times.

Model-free Policy Learning with Reward Gradients

1 code implementation9 Mar 2021 Qingfeng Lan, Samuele Tosatto, Homayoon Farrahi, A. Rupam Mahmood

As a key component in reinforcement learning, the reward function is usually devised carefully to guide the agent.

Continuous Control Policy Gradient Methods

Cannot find the paper you are looking for? You can Submit a new open access paper.