Alert button
Picture for Tanmay Gangwani

Tanmay Gangwani

Alert button

Multi-Objective Optimization via Wasserstein-Fisher-Rao Gradient Flow

Add code
Bookmark button
Alert button
Nov 22, 2023
Yinuo Ren, Tesi Xiao, Tanmay Gangwani, Anshuka Rangi, Holakou Rahmanian, Lexing Ying, Subhajit Sanyal

Viaarxiv icon

Selective Uncertainty Propagation in Offline RL

Add code
Bookmark button
Alert button
Feb 01, 2023
Sanath Kumar Krishnamurthy, Tanmay Gangwani, Sumeet Katariya, Branislav Kveton, Anshuka Rangi

Figure 1 for Selective Uncertainty Propagation in Offline RL
Figure 2 for Selective Uncertainty Propagation in Offline RL
Figure 3 for Selective Uncertainty Propagation in Offline RL
Figure 4 for Selective Uncertainty Propagation in Offline RL
Viaarxiv icon

Imitation Learning from Observations under Transition Model Disparity

Add code
Bookmark button
Alert button
Apr 25, 2022
Tanmay Gangwani, Yuan Zhou, Jian Peng

Figure 1 for Imitation Learning from Observations under Transition Model Disparity
Figure 2 for Imitation Learning from Observations under Transition Model Disparity
Figure 3 for Imitation Learning from Observations under Transition Model Disparity
Figure 4 for Imitation Learning from Observations under Transition Model Disparity
Viaarxiv icon

Hindsight Foresight Relabeling for Meta-Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 18, 2021
Michael Wan, Jian Peng, Tanmay Gangwani

Figure 1 for Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Figure 2 for Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Figure 3 for Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Figure 4 for Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Viaarxiv icon

Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity

Add code
Bookmark button
Alert button
Nov 05, 2020
Tanmay Gangwani, Jian Peng, Yuan Zhou

Figure 1 for Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Figure 2 for Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Figure 3 for Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Figure 4 for Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Viaarxiv icon

Learning Guidance Rewards with Trajectory-space Smoothing

Add code
Bookmark button
Alert button
Oct 23, 2020
Tanmay Gangwani, Yuan Zhou, Jian Peng

Figure 1 for Learning Guidance Rewards with Trajectory-space Smoothing
Figure 2 for Learning Guidance Rewards with Trajectory-space Smoothing
Figure 3 for Learning Guidance Rewards with Trajectory-space Smoothing
Figure 4 for Learning Guidance Rewards with Trajectory-space Smoothing
Viaarxiv icon

Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch

Add code
Bookmark button
Alert button
Jun 12, 2020
Michael Wan, Tanmay Gangwani, Jian Peng

Figure 1 for Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch
Figure 2 for Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch
Figure 3 for Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch
Figure 4 for Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch
Viaarxiv icon

State-only Imitation with Transition Dynamics Mismatch

Add code
Bookmark button
Alert button
Feb 27, 2020
Tanmay Gangwani, Jian Peng

Figure 1 for State-only Imitation with Transition Dynamics Mismatch
Figure 2 for State-only Imitation with Transition Dynamics Mismatch
Figure 3 for State-only Imitation with Transition Dynamics Mismatch
Figure 4 for State-only Imitation with Transition Dynamics Mismatch
Viaarxiv icon

Learning Belief Representations for Imitation Learning in POMDPs

Add code
Bookmark button
Alert button
Jun 22, 2019
Tanmay Gangwani, Joel Lehman, Qiang Liu, Jian Peng

Figure 1 for Learning Belief Representations for Imitation Learning in POMDPs
Figure 2 for Learning Belief Representations for Imitation Learning in POMDPs
Figure 3 for Learning Belief Representations for Imitation Learning in POMDPs
Figure 4 for Learning Belief Representations for Imitation Learning in POMDPs
Viaarxiv icon

Learning Self-Imitating Diverse Policies

Add code
Bookmark button
Alert button
May 25, 2018
Tanmay Gangwani, Qiang Liu, Jian Peng

Figure 1 for Learning Self-Imitating Diverse Policies
Figure 2 for Learning Self-Imitating Diverse Policies
Figure 3 for Learning Self-Imitating Diverse Policies
Figure 4 for Learning Self-Imitating Diverse Policies
Viaarxiv icon