Picture for Ashwin Balakrishna

Ashwin Balakrishna

Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations

Add code
Oct 14, 2022
Figure 1 for Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations
Figure 2 for Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations
Figure 3 for Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations
Figure 4 for Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations
Viaarxiv icon

Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies

Add code
Jul 02, 2022
Figure 1 for Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies
Figure 2 for Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies
Figure 3 for Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies
Figure 4 for Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies
Viaarxiv icon

Dynamics-Aware Comparison of Learned Reward Functions

Add code
Jan 25, 2022
Figure 1 for Dynamics-Aware Comparison of Learned Reward Functions
Figure 2 for Dynamics-Aware Comparison of Learned Reward Functions
Figure 3 for Dynamics-Aware Comparison of Learned Reward Functions
Figure 4 for Dynamics-Aware Comparison of Learned Reward Functions
Viaarxiv icon

MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance

Add code
Dec 07, 2021
Figure 1 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Figure 2 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Figure 3 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Figure 4 for MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance
Viaarxiv icon

LEGS: Learning Efficient Grasp Sets for Exploratory Grasping

Add code
Nov 29, 2021
Figure 1 for LEGS: Learning Efficient Grasp Sets for Exploratory Grasping
Figure 2 for LEGS: Learning Efficient Grasp Sets for Exploratory Grasping
Figure 3 for LEGS: Learning Efficient Grasp Sets for Exploratory Grasping
Figure 4 for LEGS: Learning Efficient Grasp Sets for Exploratory Grasping
Viaarxiv icon

ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive Imitation Learning

Add code
Sep 17, 2021
Figure 1 for ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive Imitation Learning
Figure 2 for ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive Imitation Learning
Figure 3 for ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive Imitation Learning
Figure 4 for ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive Imitation Learning
Viaarxiv icon

Kit-Net: Self-Supervised Learning to Kit Novel 3D Objects into Novel 3D Cavities

Add code
Jul 13, 2021
Figure 1 for Kit-Net: Self-Supervised Learning to Kit Novel 3D Objects into Novel 3D Cavities
Figure 2 for Kit-Net: Self-Supervised Learning to Kit Novel 3D Objects into Novel 3D Cavities
Figure 3 for Kit-Net: Self-Supervised Learning to Kit Novel 3D Objects into Novel 3D Cavities
Figure 4 for Kit-Net: Self-Supervised Learning to Kit Novel 3D Objects into Novel 3D Cavities
Viaarxiv icon

LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Iterative Tasks

Add code
Jul 10, 2021
Figure 1 for LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Iterative Tasks
Figure 2 for LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Iterative Tasks
Figure 3 for LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Iterative Tasks
Figure 4 for LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Iterative Tasks
Viaarxiv icon

Untangling Dense Non-Planar Knots by Learning Manipulation Features and Recovery Policies

Add code
Jun 29, 2021
Figure 1 for Untangling Dense Non-Planar Knots by Learning Manipulation Features and Recovery Policies
Figure 2 for Untangling Dense Non-Planar Knots by Learning Manipulation Features and Recovery Policies
Figure 3 for Untangling Dense Non-Planar Knots by Learning Manipulation Features and Recovery Policies
Figure 4 for Untangling Dense Non-Planar Knots by Learning Manipulation Features and Recovery Policies
Viaarxiv icon

Policy Gradient Bayesian Robust Optimization for Imitation Learning

Add code
Jun 21, 2021
Figure 1 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Figure 2 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Figure 3 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Figure 4 for Policy Gradient Bayesian Robust Optimization for Imitation Learning
Viaarxiv icon