no code implementations • 9 Dec 2019 • Ujwal Padam Tewari, Vishal Bidawatka, Varsha Raveendran, Vinay Sudhakaran, Shreedhar Kodate Shreeshail, Jayanth Prakash Kulkarni
We use Asynchronous Advantage Actor Critic (A3C) for implementing an AI agent in the controllers that optimize flow of traffic across a single intersection and then extend it to multiple intersections by considering a multi-agent setting.
Multi-agent Reinforcement Learning reinforcement-learning +1