Alert button
Picture for Sandy H. Huang

Sandy H. Huang

Alert button

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots

Add code
Bookmark button
Alert button
Dec 18, 2023
Thomas Lampe, Abbas Abdolmaleki, Sarah Bechtle, Sandy H. Huang, Jost Tobias Springenberg, Michael Bloesch, Oliver Groth, Roland Hafner, Tim Hertweck, Michael Neunert, Markus Wulfmeier, Jingwei Zhang, Francesco Nori, Nicolas Heess, Martin Riedmiller

Viaarxiv icon

Coherent Soft Imitation Learning

Add code
Bookmark button
Alert button
May 29, 2023
Joe Watson, Sandy H. Huang, Nicolas Heess

Figure 1 for Coherent Soft Imitation Learning
Figure 2 for Coherent Soft Imitation Learning
Figure 3 for Coherent Soft Imitation Learning
Figure 4 for Coherent Soft Imitation Learning
Viaarxiv icon

Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 26, 2023
Tuomas Haarnoja, Ben Moran, Guy Lever, Sandy H. Huang, Dhruva Tirumala, Markus Wulfmeier, Jan Humplik, Saran Tunyasuvunakool, Noah Y. Siegel, Roland Hafner, Michael Bloesch, Kristian Hartikainen, Arunkumar Byravan, Leonard Hasenclever, Yuval Tassa, Fereshteh Sadeghi, Nathan Batchelor, Federico Casarini, Stefano Saliceti, Charles Game, Neil Sreendra, Kushal Patel, Marlon Gwira, Andrea Huber, Nicole Hurley, Francesco Nori, Raia Hadsell, Nicolas Heess

Figure 1 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Figure 2 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Figure 3 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Figure 4 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Viaarxiv icon

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 15, 2021
Abbas Abdolmaleki, Sandy H. Huang, Giulia Vezzani, Bobak Shahriari, Jost Tobias Springenberg, Shruti Mishra, Dhruva TB, Arunkumar Byravan, Konstantinos Bousmalis, Andras Gyorgy, Csaba Szepesvari, Raia Hadsell, Nicolas Heess, Martin Riedmiller

Figure 1 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 2 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 3 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Figure 4 for On Multi-objective Policy Optimization as a Tool for Reinforcement Learning
Viaarxiv icon

A Distributional View on Multi-Objective Policy Optimization

Add code
Bookmark button
Alert button
May 15, 2020
Abbas Abdolmaleki, Sandy H. Huang, Leonard Hasenclever, Michael Neunert, H. Francis Song, Martina Zambelli, Murilo F. Martins, Nicolas Heess, Raia Hadsell, Martin Riedmiller

Figure 1 for A Distributional View on Multi-Objective Policy Optimization
Figure 2 for A Distributional View on Multi-Objective Policy Optimization
Figure 3 for A Distributional View on Multi-Objective Policy Optimization
Figure 4 for A Distributional View on Multi-Objective Policy Optimization
Viaarxiv icon

Nonverbal Robot Feedback for Human Teachers

Add code
Bookmark button
Alert button
Nov 06, 2019
Sandy H. Huang, Isabella Huang, Ravi Pandya, Anca D. Dragan

Figure 1 for Nonverbal Robot Feedback for Human Teachers
Figure 2 for Nonverbal Robot Feedback for Human Teachers
Figure 3 for Nonverbal Robot Feedback for Human Teachers
Figure 4 for Nonverbal Robot Feedback for Human Teachers
Viaarxiv icon

Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 20, 2019
Sandy H. Huang, Martina Zambelli, Jackie Kay, Murilo F. Martins, Yuval Tassa, Patrick M. Pilarski, Raia Hadsell

Figure 1 for Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning
Figure 2 for Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning
Figure 3 for Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning
Figure 4 for Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning
Viaarxiv icon

Human-AI Learning Performance in Multi-Armed Bandits

Add code
Bookmark button
Alert button
Dec 21, 2018
Ravi Pandya, Sandy H. Huang, Dylan Hadfield-Menell, Anca D. Dragan

Figure 1 for Human-AI Learning Performance in Multi-Armed Bandits
Figure 2 for Human-AI Learning Performance in Multi-Armed Bandits
Figure 3 for Human-AI Learning Performance in Multi-Armed Bandits
Figure 4 for Human-AI Learning Performance in Multi-Armed Bandits
Viaarxiv icon

Enabling Robots to Communicate their Objectives

Add code
Bookmark button
Alert button
Oct 18, 2018
Sandy H. Huang, David Held, Pieter Abbeel, Anca D. Dragan

Figure 1 for Enabling Robots to Communicate their Objectives
Figure 2 for Enabling Robots to Communicate their Objectives
Figure 3 for Enabling Robots to Communicate their Objectives
Figure 4 for Enabling Robots to Communicate their Objectives
Viaarxiv icon