Methodology

Policy Gradient Methods

90 papers with code • 0 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Policy Gradient Methods

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Libraries

Use these libraries to find Policy Gradient Methods models and implementations

DLR-RM/stable-baselines3

3 papers

7,990

hill-a/stable-baselines

2 papers

4,050

chainer/chainerrl

2 papers

1,155

tensorlayer/RLzoo

2 papers

614

See all 7 libraries.

Datasets

Latest papers with no code

Most implemented Social Latest No code

Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process

no code yet • 7 Mar 2024

Nevertheless, when applying policy gradients to SDEs, since the policy gradient is estimated on a finite set of trajectories, it can be ill-defined, and the policy behavior in data-scarce regions may be uncontrolled.

Paper
Add Code

Towards Provable Log Density Policy Gradient

no code yet • 3 Mar 2024

In this work, we argue that this residual term is significant and correcting for it could potentially improve sample-complexity of reinforcement learning methods.

Paper
Add Code

Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate

no code yet • 1 Mar 2024

The efficient utilization of historical trajectories obtained from previous policies is essential for expediting policy optimization.

Paper
Add Code

When Do Off-Policy and On-Policy Policy Gradient Methods Align?

no code yet • 19 Feb 2024

A well-established off-policy objective is the excursion objective.

Paper
Add Code

Identifying Policy Gradient Subspaces

no code yet • 12 Jan 2024

Policy gradient methods hold great potential for solving complex continuous control tasks.

Paper
Add Code

Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction

no code yet • 2 Jan 2024

Natural policy gradient (NPG) and its variants are widely-used policy search methods in reinforcement learning.

Paper
Add Code

Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property

no code yet • 19 Dec 2023

Policy gradient methods enjoy strong practical performance in numerous tasks in reinforcement learning.

Paper
Add Code

Privacy Preserving Multi-Agent Reinforcement Learning in Supply Chains

no code yet • 9 Dec 2023

To tackle this challenge, we propose a game-theoretic, privacy-preserving mechanism, utilizing a secure multi-party computation (MPC) framework in MARL settings.

Paper
Add Code

RL Dreams: Policy Gradient Optimization for Score Distillation based 3D Generation

no code yet • 8 Dec 2023

Further, the recent work of Denoising Diffusion Policy Optimization (DDPO) demonstrates that the diffusion process is compatible with policy gradient methods and has been demonstrated to improve the 2D diffusion models using an aesthetic scoring function.

Paper
Add Code

Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions: Applications to Product-Form Stochastic Networks and Queueing Systems

no code yet • 5 Dec 2023

As a second contribution, we show that, under appropriate assumptions, the policy under a SAGE-based policy-gradient method has a large probability of converging to an optimal policy, provided that it starts sufficiently close to it, even with a nonconvex objective function and multiple maximizers.

Paper
Add Code

Policy Gradient Methods

Benchmarks Add a Result

Libraries

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result