Search Results for author: Calarina Muslimani

Found 2 papers, 0 papers with code

Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning

no code implementations • 30 Apr 2024 • Calarina Muslimani, Matthew E. Taylor

To improve the feedback efficiency of HitL RL methods (i. e., require less feedback), this paper introduces Sub-optimal Data Pre-training, SDP, an approach that leverages reward-free, sub-optimal data to improve scalar- and preference-based HitL RL algorithms.

Paper
Add Code

Reinforcement Teaching

no code implementations • 25 Apr 2022 • Alex Lewandowski, Calarina Muslimani, Dale Schuurmans, Matthew E. Taylor, Jun Luo

To effectively learn such a teaching policy, we introduce a parametric-behavior embedder that learns a representation of the student's learnable parameters from its input/output behavior.

Meta-Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.