Alert button
Picture for Peter Stone

Peter Stone

Alert button

Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning

Add code
Bookmark button
Alert button
May 06, 2024
Caleb Chuck, Carl Qi, Michael J. Munje, Shuozhe Li, Max Rudolph, Chang Shi, Siddhant Agarwal, Harshit Sikchi, Abhinav Peri, Sarthak Dayal, Evan Kuo, Kavan Mehta, Anthony Wang, Peter Stone, Amy Zhang, Scott Niekum

Viaarxiv icon

N-Agent Ad Hoc Teamwork

Add code
Bookmark button
Alert button
Apr 16, 2024
Caroline Wang, Arrasy Rahman, Ishan Durugkar, Elad Liebman, Peter Stone

Figure 1 for N-Agent Ad Hoc Teamwork
Figure 2 for N-Agent Ad Hoc Teamwork
Figure 3 for N-Agent Ad Hoc Teamwork
Figure 4 for N-Agent Ad Hoc Teamwork
Viaarxiv icon

Dyna-LfLH: Learning Agile Navigation in Dynamic Environments from Learned Hallucination

Add code
Bookmark button
Alert button
Mar 25, 2024
Saad Abdul Ghani, Zizhao Wang, Peter Stone, Xuesu Xiao

Figure 1 for Dyna-LfLH: Learning Agile Navigation in Dynamic Environments from Learned Hallucination
Figure 2 for Dyna-LfLH: Learning Agile Navigation in Dynamic Environments from Learned Hallucination
Figure 3 for Dyna-LfLH: Learning Agile Navigation in Dynamic Environments from Learned Hallucination
Figure 4 for Dyna-LfLH: Learning Agile Navigation in Dynamic Environments from Learned Hallucination
Viaarxiv icon

Multistep Inverse Is Not All You Need

Add code
Bookmark button
Alert button
Mar 18, 2024
Alexander Levine, Peter Stone, Amy Zhang

Figure 1 for Multistep Inverse Is Not All You Need
Figure 2 for Multistep Inverse Is Not All You Need
Figure 3 for Multistep Inverse Is Not All You Need
Figure 4 for Multistep Inverse Is Not All You Need
Viaarxiv icon

TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation

Add code
Bookmark button
Alert button
Mar 12, 2024
Shivin Dass, Wensi Ai, Yuqian Jiang, Samik Singh, Jiaheng Hu, Ruohan Zhang, Peter Stone, Ben Abbatematteo, Roberto Martin-Martin

Figure 1 for TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation
Figure 2 for TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation
Figure 3 for TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation
Figure 4 for TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation
Viaarxiv icon

Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks

Add code
Bookmark button
Alert button
Mar 06, 2024
Ziping Xu, Zifan Xu, Runxuan Jiang, Peter Stone, Ambuj Tewari

Figure 1 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Figure 2 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Figure 3 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Figure 4 for Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Viaarxiv icon

Dexterous Legged Locomotion in Confined 3D Spaces with Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 06, 2024
Zifan Xu, Amir Hossain Raj, Xuesu Xiao, Peter Stone

Figure 1 for Dexterous Legged Locomotion in Confined 3D Spaces with Reinforcement Learning
Figure 2 for Dexterous Legged Locomotion in Confined 3D Spaces with Reinforcement Learning
Figure 3 for Dexterous Legged Locomotion in Confined 3D Spaces with Reinforcement Learning
Figure 4 for Dexterous Legged Locomotion in Confined 3D Spaces with Reinforcement Learning
Viaarxiv icon

Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 23, 2024
Zizhao Wang, Caroline Wang, Xuesu Xiao, Yuke Zhu, Peter Stone

Viaarxiv icon

t-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making

Add code
Bookmark button
Alert button
Jan 04, 2024
William Yue, Bo Liu, Peter Stone

Viaarxiv icon

Latent Skill Discovery for Chain-of-Thought Reasoning

Add code
Bookmark button
Alert button
Dec 07, 2023
Zifan Xu, Haozhu Wang, Dmitriy Bespalov, Peter Stone, Yanjun Qi

Figure 1 for Latent Skill Discovery for Chain-of-Thought Reasoning
Figure 2 for Latent Skill Discovery for Chain-of-Thought Reasoning
Figure 3 for Latent Skill Discovery for Chain-of-Thought Reasoning
Figure 4 for Latent Skill Discovery for Chain-of-Thought Reasoning
Viaarxiv icon