Alert button
Picture for Himanshu Sahni

Himanshu Sahni

Alert button

Vision-Language Models as a Source of Rewards

Add code
Bookmark button
Alert button
Dec 14, 2023
Kate Baumli, Satinder Baveja, Feryal Behbahani, Harris Chan, Gheorghe Comanici, Sebastian Flennerhag, Maxime Gazeau, Kristian Holsheimer, Dan Horgan, Michael Laskin, Clare Lyle, Hussain Masoom, Kay McKinney, Volodymyr Mnih, Alexander Neitz, Fabio Pardo, Jack Parker-Holder, John Quan, Tim Rocktäschel, Himanshu Sahni, Tom Schaul, Yannick Schroecker, Stephen Spencer, Richie Steigerwald, Luyu Wang, Lei Zhang

Viaarxiv icon

In-context Reinforcement Learning with Algorithm Distillation

Add code
Bookmark button
Alert button
Oct 25, 2022
Michael Laskin, Luyu Wang, Junhyuk Oh, Emilio Parisotto, Stephen Spencer, Richie Steigerwald, DJ Strouse, Steven Hansen, Angelos Filos, Ethan Brooks, Maxime Gazeau, Himanshu Sahni, Satinder Singh, Volodymyr Mnih

Figure 1 for In-context Reinforcement Learning with Algorithm Distillation
Figure 2 for In-context Reinforcement Learning with Algorithm Distillation
Figure 3 for In-context Reinforcement Learning with Algorithm Distillation
Figure 4 for In-context Reinforcement Learning with Algorithm Distillation
Viaarxiv icon

Hard Attention Control By Mutual Information Maximization

Add code
Bookmark button
Alert button
Mar 10, 2021
Himanshu Sahni, Charles Isbell

Figure 1 for Hard Attention Control By Mutual Information Maximization
Figure 2 for Hard Attention Control By Mutual Information Maximization
Figure 3 for Hard Attention Control By Mutual Information Maximization
Figure 4 for Hard Attention Control By Mutual Information Maximization
Viaarxiv icon

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

Add code
Bookmark button
Alert button
Feb 21, 2020
Ashley D. Edwards, Himanshu Sahni, Rosanne Liu, Jane Hung, Ankit Jain, Rui Wang, Adrien Ecoffet, Thomas Miconi, Charles Isbell, Jason Yosinski

Figure 1 for Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Figure 2 for Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Figure 3 for Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Figure 4 for Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Viaarxiv icon

Visual Hindsight Experience Replay

Add code
Bookmark button
Alert button
Jan 31, 2019
Himanshu Sahni, Toby Buckley, Pieter Abbeel, Ilya Kuzovkin

Figure 1 for Visual Hindsight Experience Replay
Figure 2 for Visual Hindsight Experience Replay
Figure 3 for Visual Hindsight Experience Replay
Figure 4 for Visual Hindsight Experience Replay
Viaarxiv icon

Imitating Latent Policies from Observation

Add code
Bookmark button
Alert button
May 24, 2018
Ashley D. Edwards, Himanshu Sahni, Yannick Schroecker, Charles L. Isbell

Figure 1 for Imitating Latent Policies from Observation
Figure 2 for Imitating Latent Policies from Observation
Figure 3 for Imitating Latent Policies from Observation
Figure 4 for Imitating Latent Policies from Observation
Viaarxiv icon

Learning to Compose Skills

Add code
Bookmark button
Alert button
Nov 30, 2017
Himanshu Sahni, Saurabh Kumar, Farhan Tejani, Charles Isbell

Figure 1 for Learning to Compose Skills
Figure 2 for Learning to Compose Skills
Figure 3 for Learning to Compose Skills
Figure 4 for Learning to Compose Skills
Viaarxiv icon

State Space Decomposition and Subgoal Creation for Transfer in Deep Reinforcement Learning

Add code
Bookmark button
Alert button
May 24, 2017
Himanshu Sahni, Saurabh Kumar, Farhan Tejani, Yannick Schroecker, Charles Isbell

Figure 1 for State Space Decomposition and Subgoal Creation for Transfer in Deep Reinforcement Learning
Figure 2 for State Space Decomposition and Subgoal Creation for Transfer in Deep Reinforcement Learning
Viaarxiv icon