Picture for Ofir Nachum

Ofir Nachum

Tony

Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real

Add code
Aug 13, 2019
Figure 1 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Figure 2 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Figure 3 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Figure 4 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Viaarxiv icon

DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections

Add code
Jun 10, 2019
Figure 1 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Figure 2 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Figure 3 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Figure 4 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Viaarxiv icon

DeepMDP: Learning Continuous Latent Space Models for Representation Learning

Add code
Jun 06, 2019
Figure 1 for DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Figure 2 for DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Figure 3 for DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Figure 4 for DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Viaarxiv icon

Lyapunov-based Safe Policy Optimization for Continuous Control

Add code
Jan 28, 2019
Figure 1 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 2 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 3 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 4 for Lyapunov-based Safe Policy Optimization for Continuous Control
Viaarxiv icon

Identifying and Correcting Label Bias in Machine Learning

Add code
Jan 15, 2019
Figure 1 for Identifying and Correcting Label Bias in Machine Learning
Figure 2 for Identifying and Correcting Label Bias in Machine Learning
Figure 3 for Identifying and Correcting Label Bias in Machine Learning
Figure 4 for Identifying and Correcting Label Bias in Machine Learning
Viaarxiv icon

The Laplacian in RL: Learning Representations with Efficient Approximations

Add code
Oct 10, 2018
Figure 1 for The Laplacian in RL: Learning Representations with Efficient Approximations
Figure 2 for The Laplacian in RL: Learning Representations with Efficient Approximations
Figure 3 for The Laplacian in RL: Learning Representations with Efficient Approximations
Figure 4 for The Laplacian in RL: Learning Representations with Efficient Approximations
Viaarxiv icon

Data-Efficient Hierarchical Reinforcement Learning

Add code
Oct 05, 2018
Figure 1 for Data-Efficient Hierarchical Reinforcement Learning
Figure 2 for Data-Efficient Hierarchical Reinforcement Learning
Figure 3 for Data-Efficient Hierarchical Reinforcement Learning
Figure 4 for Data-Efficient Hierarchical Reinforcement Learning
Viaarxiv icon

Near-Optimal Representation Learning for Hierarchical Reinforcement Learning

Add code
Oct 02, 2018
Figure 1 for Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Figure 2 for Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Figure 3 for Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Figure 4 for Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Viaarxiv icon

Smoothed Action Value Functions for Learning Gaussian Policies

Add code
Jul 25, 2018
Figure 1 for Smoothed Action Value Functions for Learning Gaussian Policies
Viaarxiv icon

A Lyapunov-based Approach to Safe Reinforcement Learning

Add code
May 20, 2018
Figure 1 for A Lyapunov-based Approach to Safe Reinforcement Learning
Figure 2 for A Lyapunov-based Approach to Safe Reinforcement Learning
Viaarxiv icon