Alert button
Picture for Ofir Nachum

Ofir Nachum

Alert button

Group-based Fair Learning Leads to Counter-intuitive Predictions

Add code
Bookmark button
Alert button
Oct 04, 2019
Ofir Nachum, Heinrich Jiang

Figure 1 for Group-based Fair Learning Leads to Counter-intuitive Predictions
Figure 2 for Group-based Fair Learning Leads to Counter-intuitive Predictions
Figure 3 for Group-based Fair Learning Leads to Counter-intuitive Predictions
Figure 4 for Group-based Fair Learning Leads to Counter-intuitive Predictions
Viaarxiv icon

Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?

Add code
Bookmark button
Alert button
Sep 23, 2019
Ofir Nachum, Haoran Tang, Xingyu Lu, Shixiang Gu, Honglak Lee, Sergey Levine

Figure 1 for Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
Figure 2 for Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
Figure 3 for Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
Figure 4 for Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
Viaarxiv icon

Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real

Add code
Bookmark button
Alert button
Aug 13, 2019
Ofir Nachum, Michael Ahn, Hugo Ponte, Shixiang Gu, Vikash Kumar

Figure 1 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Figure 2 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Figure 3 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Figure 4 for Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real
Viaarxiv icon

DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections

Add code
Bookmark button
Alert button
Jun 10, 2019
Ofir Nachum, Yinlam Chow, Bo Dai, Lihong Li

Figure 1 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Figure 2 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Figure 3 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Figure 4 for DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Viaarxiv icon

DeepMDP: Learning Continuous Latent Space Models for Representation Learning

Add code
Bookmark button
Alert button
Jun 06, 2019
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare

Figure 1 for DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Figure 2 for DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Figure 3 for DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Figure 4 for DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Viaarxiv icon

Lyapunov-based Safe Policy Optimization for Continuous Control

Add code
Bookmark button
Alert button
Jan 28, 2019
Yinlam Chow, Ofir Nachum, Aleksandra Faust, Mohammad Ghavamzadeh, Edgar Duenez-Guzman

Figure 1 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 2 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 3 for Lyapunov-based Safe Policy Optimization for Continuous Control
Figure 4 for Lyapunov-based Safe Policy Optimization for Continuous Control
Viaarxiv icon

Identifying and Correcting Label Bias in Machine Learning

Add code
Bookmark button
Alert button
Jan 15, 2019
Heinrich Jiang, Ofir Nachum

Figure 1 for Identifying and Correcting Label Bias in Machine Learning
Figure 2 for Identifying and Correcting Label Bias in Machine Learning
Figure 3 for Identifying and Correcting Label Bias in Machine Learning
Figure 4 for Identifying and Correcting Label Bias in Machine Learning
Viaarxiv icon

The Laplacian in RL: Learning Representations with Efficient Approximations

Add code
Bookmark button
Alert button
Oct 10, 2018
Yifan Wu, George Tucker, Ofir Nachum

Figure 1 for The Laplacian in RL: Learning Representations with Efficient Approximations
Figure 2 for The Laplacian in RL: Learning Representations with Efficient Approximations
Figure 3 for The Laplacian in RL: Learning Representations with Efficient Approximations
Figure 4 for The Laplacian in RL: Learning Representations with Efficient Approximations
Viaarxiv icon

Data-Efficient Hierarchical Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 05, 2018
Ofir Nachum, Shixiang Gu, Honglak Lee, Sergey Levine

Figure 1 for Data-Efficient Hierarchical Reinforcement Learning
Figure 2 for Data-Efficient Hierarchical Reinforcement Learning
Figure 3 for Data-Efficient Hierarchical Reinforcement Learning
Figure 4 for Data-Efficient Hierarchical Reinforcement Learning
Viaarxiv icon

Near-Optimal Representation Learning for Hierarchical Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 02, 2018
Ofir Nachum, Shixiang Gu, Honglak Lee, Sergey Levine

Figure 1 for Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Figure 2 for Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Figure 3 for Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Figure 4 for Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Viaarxiv icon