Data aggregation techniques can significantly improve vision-based policy learning within a training environment, e.g., learning to drive in a specific simulation condition. However, as on-policy data is sequentially sampled and added in an iterative manner, the policy can specialize and overfit to the training conditions... (read more)
PDF AbstractMETHOD | TYPE | |
---|---|---|
![]() |
Video Game Models |