Search Results for author: Mehul Damani

Found 5 papers, 1 papers with code

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

no code implementations • 27 Jul 2023 • Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Biyik, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell

Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals.

reinforcement-learning

Paper
Add Code

SocialLight: Distributed Cooperation Learning towards Network-Wide Traffic Signal Control

no code implementations • 20 Apr 2023 • Harsh Goel, Yifeng Zhang, Mehul Damani, Guillaume Sartoretti

To address these problems, we propose a new MARL method for traffic signal control, SocialLight, which learns cooperative traffic control policies by distributedly estimating the individual marginal contribution of agents on their local neighborhood.

counterfactual Counterfactual Reasoning +1

Paper
Add Code

Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL

1 code implementation • 22 Aug 2022 • Andreas A. Haupt, Phillip J. K. Christoffersen, Mehul Damani, Dylan Hadfield-Menell

In this work, we draw upon the idea of formal contracting from economics to overcome diverging incentives between agents in MARL.

Management Multi-agent Reinforcement Learning +2

Paper
Code

Distributed Reinforcement Learning for Robot Teams: A Review

no code implementations • 7 Apr 2022 • Yutong Wang, Mehul Damani, Pamela Wang, Yuhong Cao, Guillaume Sartoretti

This review aims to provide an analysis of the state-of-the-art in distributed MARL for multi-robot cooperation.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World

no code implementations • 30 Mar 2021 • Florian Laurent, Manuel Schneider, Christian Scheller, Jeremy Watson, Jiaoyang Li, Zhe Chen, Yi Zheng, Shao-Hung Chan, Konstantin Makhnev, Oleg Svidchenko, Vladimir Egorov, Dmitry Ivanov, Aleksei Shpilman, Evgenija Spirovska, Oliver Tanevski, Aleksandar Nikov, Ramon Grunder, David Galevski, Jakov Mitrovski, Guillaume Sartoretti, Zhiyao Luo, Mehul Damani, Nilabha Bhattacharya, Shivam Agarwal, Adrian Egli, Erik Nygren, Sharada Mohanty

However, the coordination of hundreds of agents in a real-life setting like a railway network remains challenging and the Flatland environment used for the competition models these real-world properties in a simplified manner.

Multi-agent Reinforcement Learning Scheduling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.