no code implementations • ICLR 2020 • Christof Angermueller, David Dohan, David Belanger, Ramya Deshpande, Kevin Murphy, Lucy Colwell
In response, we propose using reinforcement learning (RL) based on proximal-policy optimization (PPO) for biological sequence design.
Model-based Reinforcement Learning reinforcement-learning +1