no code implementations • 4 Dec 2023 • Sidney Tio, Jimmy Ho, Pradeep Varakantham
We adapt Parameterized Environment Response Model (PERM), a method for training both Reinforcement Learning (RL) Agents and human learners in parameterized environments by directly modeling difficulty and ability.