Alert button
Picture for Udari Madhushani

Udari Madhushani

Alert button

O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models

Add code
Bookmark button
Alert button
Oct 22, 2023
Yuchen Xiao, Yanchao Sun, Mengda Xu, Udari Madhushani, Jared Vann, Deepeka Garg, Sumitra Ganesh

Viaarxiv icon

Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas

Add code
Bookmark button
Alert button
May 01, 2023
Udari Madhushani, Kevin R. McKee, John P. Agapiou, Joel Z. Leibo, Richard Everett, Thomas Anthony, Edward Hughes, Karl Tuyls, Edgar A. Duéñez-Guzmán

Figure 1 for Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Figure 2 for Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Figure 3 for Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Figure 4 for Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Viaarxiv icon

Melting Pot 2.0

Add code
Bookmark button
Alert button
Dec 13, 2022
John P. Agapiou, Alexander Sasha Vezhnevets, Edgar A. Duéñez-Guzmán, Jayd Matyas, Yiran Mao, Peter Sunehag, Raphael Köster, Udari Madhushani, Kavya Kopparapu, Ramona Comanescu, DJ Strouse, Michael B. Johanson, Sukhdeep Singh, Julia Haas, Igor Mordatch, Dean Mobbs, Joel Z. Leibo

Figure 1 for Melting Pot 2.0
Figure 2 for Melting Pot 2.0
Figure 3 for Melting Pot 2.0
Figure 4 for Melting Pot 2.0
Viaarxiv icon

A Regret Minimization Approach to Multi-Agent Control

Add code
Bookmark button
Alert button
Feb 01, 2022
Udaya Ghai, Udari Madhushani, Naomi Leonard, Elad Hazan

Figure 1 for A Regret Minimization Approach to Multi-Agent Control
Figure 2 for A Regret Minimization Approach to Multi-Agent Control
Viaarxiv icon

One More Step Towards Reality: Cooperative Bandits with Imperfect Communication

Add code
Bookmark button
Alert button
Nov 24, 2021
Udari Madhushani, Abhimanyu Dubey, Naomi Ehrich Leonard, Alex Pentland

Figure 1 for One More Step Towards Reality: Cooperative Bandits with Imperfect Communication
Figure 2 for One More Step Towards Reality: Cooperative Bandits with Imperfect Communication
Figure 3 for One More Step Towards Reality: Cooperative Bandits with Imperfect Communication
Figure 4 for One More Step Towards Reality: Cooperative Bandits with Imperfect Communication
Viaarxiv icon

Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication

Add code
Bookmark button
Alert button
Oct 14, 2021
Justin Lidard, Udari Madhushani, Naomi Ehrich Leonard

Figure 1 for Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication
Figure 2 for Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication
Viaarxiv icon

When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits

Add code
Bookmark button
Alert button
Oct 08, 2021
Udari Madhushani, Naomi Leonard

Figure 1 for When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits
Figure 2 for When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits
Figure 3 for When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits
Viaarxiv icon

Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL

Add code
Bookmark button
Alert button
Dec 06, 2020
Udari Madhushani, Biswadip Dey, Naomi Ehrich Leonard, Amit Chakraborty

Figure 1 for Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL
Figure 2 for Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL
Figure 3 for Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL
Figure 4 for Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL
Viaarxiv icon

Distributed Bandits: Probabilistic Communication on $d$-regular Graphs

Add code
Bookmark button
Alert button
Nov 16, 2020
Udari Madhushani, Naomi Ehrich Leonard

Figure 1 for Distributed Bandits: Probabilistic Communication on $d$-regular Graphs
Figure 2 for Distributed Bandits: Probabilistic Communication on $d$-regular Graphs
Figure 3 for Distributed Bandits: Probabilistic Communication on $d$-regular Graphs
Figure 4 for Distributed Bandits: Probabilistic Communication on $d$-regular Graphs
Viaarxiv icon