Search Results for author: Maryam Hashemzadeh

Found 4 papers, 0 papers with code

Sub-goal Distillation: A Method to Improve Small Language Agents

no code implementations4 May 2024 Maryam Hashemzadeh, Elias Stengel-Eskin, Sarath Chandar, Marc-Alexandre Cote

While Large Language Models (LLMs) have demonstrated significant promise as agents in interactive tasks, their substantial computational requirements and restricted number of calls constrain their practical utility, especially in long-horizon interactive tasks such as decision-making or in scenarios involving continuous ongoing tasks.

Offline-Online Reinforcement Learning: Extending Batch and Online RL

no code implementations29 Sep 2021 Maryam Hashemzadeh, Wesley Chung, Martha White

To enable better performance, we investigate the offline-online setting: The agent has access to a batch of data to train on but is also allowed to learn during the evaluation phase in an online manner.

reinforcement-learning Reinforcement Learning (RL)

Exploiting generalization in the subspaces for faster model-based learning

no code implementations22 Oct 2017 Maryam Hashemzadeh, Reshad Hosseini, Majid Nili Ahmadabadi

Generalization and faster learning in a subspace are due to many-to-one mapping of experiences from the full-space to each state in the subspace.

Decision Making Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.