Alert button
Picture for Paria Rashidinejad

Paria Rashidinejad

Alert button

Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 30, 2023
Hanlin Zhu, Paria Rashidinejad, Jiantao Jiao

Figure 1 for Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Figure 2 for Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Viaarxiv icon

Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian

Add code
Bookmark button
Alert button
Nov 01, 2022
Paria Rashidinejad, Hanlin Zhu, Kunhe Yang, Stuart Russell, Jiantao Jiao

Figure 1 for Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian
Viaarxiv icon

MADE: Exploration via Maximizing Deviation from Explored Regions

Add code
Bookmark button
Alert button
Jun 18, 2021
Tianjun Zhang, Paria Rashidinejad, Jiantao Jiao, Yuandong Tian, Joseph Gonzalez, Stuart Russell

Figure 1 for MADE: Exploration via Maximizing Deviation from Explored Regions
Figure 2 for MADE: Exploration via Maximizing Deviation from Explored Regions
Figure 3 for MADE: Exploration via Maximizing Deviation from Explored Regions
Figure 4 for MADE: Exploration via Maximizing Deviation from Explored Regions
Viaarxiv icon

Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism

Add code
Bookmark button
Alert button
Mar 22, 2021
Paria Rashidinejad, Banghua Zhu, Cong Ma, Jiantao Jiao, Stuart Russell

Figure 1 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Figure 2 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Figure 3 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Figure 4 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Viaarxiv icon

SLIP: Learning to Predict in Unknown Dynamical Systems with Long-Term Memory

Add code
Bookmark button
Alert button
Oct 12, 2020
Paria Rashidinejad, Jiantao Jiao, Stuart Russell

Figure 1 for SLIP: Learning to Predict in Unknown Dynamical Systems with Long-Term Memory
Figure 2 for SLIP: Learning to Predict in Unknown Dynamical Systems with Long-Term Memory
Figure 3 for SLIP: Learning to Predict in Unknown Dynamical Systems with Long-Term Memory
Viaarxiv icon