Search Results for author: Scott M. Jordan

Found 7 papers, 2 papers with code

A New View on Planning in Online Reinforcement Learning

no code implementations3 Jun 2024 Kevin Roice, Parham Mohammad Panahi, Scott M. Jordan, Adam White, Martha White

In this paper, we avoid this limitation by constraining background planning to a set of (abstract) subgoals and learning only local, subgoal-conditioned models.

Model-based Reinforcement Learning reinforcement-learning

From Past to Future: Rethinking Eligibility Traces

no code implementations20 Dec 2023 Dhawal Gupta, Scott M. Jordan, Shreyas Chaudhari, Bo Liu, Philip S. Thomas, Bruno Castro da Silva

In this paper, we introduce a fresh perspective on the challenges of credit assignment and policy evaluation.

Coagent Networks: Generalized and Scaled

no code implementations16 May 2023 James E. Kostas, Scott M. Jordan, Yash Chandak, Georgios Theocharous, Dhawal Gupta, Martha White, Bruno Castro da Silva, Philip S. Thomas

However, the coagent framework is not just an alternative to BDL; the two approaches can be blended: BDL can be combined with coagent learning rules to create architectures with the advantages of both approaches.

Reinforcement Learning (RL)

Robust Markov Decision Processes without Model Estimation

no code implementations2 Feb 2023 Wenhao Yang, Han Wang, Tadashi Kozuno, Scott M. Jordan, Zhihua Zhang

Moreover, we prove the alternative form still plays a similar role as the original form.

Classical Policy Gradient: Preserving Bellman's Principle of Optimality

no code implementations6 Jun 2019 Philip S. Thomas, Scott M. Jordan, Yash Chandak, Chris Nota, James Kostas

We propose a new objective function for finite-horizon episodic Markov decision processes that better captures Bellman's principle of optimality, and provide an expression for the gradient of the objective.

Cannot find the paper you are looking for? You can Submit a new open access paper.