Alert button
Picture for Mehul Damani

Mehul Damani

Alert button

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Jul 27, 2023
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Bıyık, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell

Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Distributed Reinforcement Learning for Robot Teams: A Review

Add code
Bookmark button
Alert button
Apr 07, 2022
Yutong Wang, Mehul Damani, Pamela Wang, Yuhong Cao, Guillaume Sartoretti

Figure 1 for Distributed Reinforcement Learning for Robot Teams: A Review
Figure 2 for Distributed Reinforcement Learning for Robot Teams: A Review
Figure 3 for Distributed Reinforcement Learning for Robot Teams: A Review
Figure 4 for Distributed Reinforcement Learning for Robot Teams: A Review
Viaarxiv icon

Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World

Add code
Bookmark button
Alert button
Mar 30, 2021
Florian Laurent, Manuel Schneider, Christian Scheller, Jeremy Watson, Jiaoyang Li, Zhe Chen, Yi Zheng, Shao-Hung Chan, Konstantin Makhnev, Oleg Svidchenko, Vladimir Egorov, Dmitry Ivanov, Aleksei Shpilman, Evgenija Spirovska, Oliver Tanevski, Aleksandar Nikov, Ramon Grunder, David Galevski, Jakov Mitrovski, Guillaume Sartoretti, Zhiyao Luo, Mehul Damani, Nilabha Bhattacharya, Shivam Agarwal, Adrian Egli, Erik Nygren, Sharada Mohanty

Figure 1 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Figure 2 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Figure 3 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Figure 4 for Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World
Viaarxiv icon

PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong

Add code
Bookmark button
Alert button
Oct 16, 2020
Mehul Damani, Zhiyao Luo, Emerson Wenzel, Guillaume Sartoretti

Figure 1 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Figure 2 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Figure 3 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Figure 4 for PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong
Viaarxiv icon