no code implementations • 15 Aug 2023 • William Ahlberg, Alessandro Sestini, Konrad Tollmar, Linus Gisslén
MultiGAIL is based on generative adversarial imitation learning and uses multiple discriminators as reward models, inferring the environment reward by comparing the agent and distinct expert policies.