Effectively leveraging large, previously collected datasets in reinforcement learning (RL) is a key challenge for large-scale real-world applications. Offline RL algorithms promise to learn effective policies from previously-collected, static datasets without further interaction... (read more)
PDF Abstract NeurIPS 2020 PDF NeurIPS 2020 Abstract