A deep reinforcement learning (DRL) agent observes its states through observations, which may contain natural measurement errors or adversarial noises. Since the observations deviate from the true states, they can mislead the agent into making suboptimal actions... (read more)
PDF Abstract NeurIPS 2020 PDF NeurIPS 2020 Abstract