Behavioural cloning

11 papers with code • 0 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Behavioural cloning models and implementations

Latest papers with no code

Model-based trajectory stitching for improved behavioural cloning and its applications

no code yet • 8 Dec 2022

Furthermore, using the D4RL benchmarking suite, we demonstrate that state-of-the-art results are obtained by combining TS with two existing offline learning methodologies reliant on BC, model-based offline planning (MBOP) and policy constraint (TD3+BC).

Model-based Trajectory Stitching for Improved Offline Reinforcement Learning

no code yet • 21 Nov 2022

We propose a model-based data augmentation strategy, Trajectory Stitching (TS), to improve the quality of sub-optimal historical trajectories.

Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning

no code yet • 21 Nov 2022

The ability to discover optimal behaviour from fixed data sets has the potential to transfer the successes of reinforcement learning (RL) to domains where data collection is acutely problematic.

Information-Theoretic Policy Learning from Partial Observations with Fully Informed Decision Makers

no code yet • 4 Apr 2022

In this work we formulate and treat an extension of the Imitation from Observations problem.

Improving Behavioural Cloning with Human-Driven Dynamic Dataset Augmentation

no code yet • 19 Jan 2022

Behavioural cloning has been extensively used to train agents and is recognized as a fast and solid approach to teach general behaviours based on expert trajectories.

Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network

no code yet • 16 Oct 2021

Against this background, we introduce a simulation based online planning algorithm, that we call SiCLOP, for multi-agent cooperative environments.

Learning to Classify and Imitate Trading Agents in Continuous Double Auction Markets

no code yet • 4 Oct 2021

Continuous double auctions such as the limit order book employed by exchanges are widely used in practice to match buyers and sellers of a variety of financial instruments.

On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning

no code yet • ICLR 2022

But how is the performance of winning lottery tickets affected by the distributional shift inherent to reinforcement learning problems?

Semi-supervised reward learning for offline reinforcement learning

no code yet • 12 Dec 2020

In offline reinforcement learning (RL) agents are trained using a logged dataset.

Offline Reinforcement Learning Hands-On

no code yet • 29 Nov 2020

Offline Reinforcement Learning (RL) aims to turn large datasets into powerful decision-making engines without any online interactions with the environment.