Alert button
Picture for Rasool Fakoor

Rasool Fakoor

Alert button

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models

Add code
Bookmark button
Alert button
Oct 09, 2023
Zuxin Liu, Jesse Zhang, Kavosh Asadi, Yao Liu, Ding Zhao, Shoham Sabach, Rasool Fakoor

Figure 1 for TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
Figure 2 for TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
Figure 3 for TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
Figure 4 for TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
Viaarxiv icon

Budgeting Counterfactual for Offline RL

Add code
Bookmark button
Alert button
Jul 12, 2023
Yao Liu, Pratik Chaudhari, Rasool Fakoor

Figure 1 for Budgeting Counterfactual for Offline RL
Figure 2 for Budgeting Counterfactual for Offline RL
Figure 3 for Budgeting Counterfactual for Offline RL
Figure 4 for Budgeting Counterfactual for Offline RL
Viaarxiv icon

Resetting the Optimizer in Deep RL: An Empirical Study

Add code
Bookmark button
Alert button
Jun 30, 2023
Kavosh Asadi, Rasool Fakoor, Shoham Sabach

Figure 1 for Resetting the Optimizer in Deep RL: An Empirical Study
Figure 2 for Resetting the Optimizer in Deep RL: An Empirical Study
Figure 3 for Resetting the Optimizer in Deep RL: An Empirical Study
Figure 4 for Resetting the Optimizer in Deep RL: An Empirical Study
Viaarxiv icon

TD Convergence: An Optimization Perspective

Add code
Bookmark button
Alert button
Jun 30, 2023
Kavosh Asadi, Shoham Sabach, Yao Liu, Omer Gottesman, Rasool Fakoor

Figure 1 for TD Convergence: An Optimization Perspective
Viaarxiv icon

Data drift correction via time-varying importance weight estimator

Add code
Bookmark button
Alert button
Oct 04, 2022
Rasool Fakoor, Jonas Mueller, Zachary C. Lipton, Pratik Chaudhari, Alexander J. Smola

Figure 1 for Data drift correction via time-varying importance weight estimator
Figure 2 for Data drift correction via time-varying importance weight estimator
Figure 3 for Data drift correction via time-varying importance weight estimator
Figure 4 for Data drift correction via time-varying importance weight estimator
Viaarxiv icon

Task-Agnostic Continual Reinforcement Learning: In Praise of a Simple Baseline

Add code
Bookmark button
Alert button
May 28, 2022
Massimo Caccia, Jonas Mueller, Taesup Kim, Laurent Charlin, Rasool Fakoor

Figure 1 for Task-Agnostic Continual Reinforcement Learning: In Praise of a Simple Baseline
Figure 2 for Task-Agnostic Continual Reinforcement Learning: In Praise of a Simple Baseline
Figure 3 for Task-Agnostic Continual Reinforcement Learning: In Praise of a Simple Baseline
Figure 4 for Task-Agnostic Continual Reinforcement Learning: In Praise of a Simple Baseline
Viaarxiv icon

Deep Q-Network with Proximal Iteration

Add code
Bookmark button
Alert button
Dec 10, 2021
Kavosh Asadi, Rasool Fakoor, Omer Gottesman, Michael L. Littman, Alexander J. Smola

Figure 1 for Deep Q-Network with Proximal Iteration
Figure 2 for Deep Q-Network with Proximal Iteration
Figure 3 for Deep Q-Network with Proximal Iteration
Figure 4 for Deep Q-Network with Proximal Iteration
Viaarxiv icon

Deep Quantile Aggregation

Add code
Bookmark button
Alert button
Mar 16, 2021
Taesup Kim, Rasool Fakoor, Jonas Mueller, Alexander J. Smola, Ryan J. Tibshirani

Figure 1 for Deep Quantile Aggregation
Figure 2 for Deep Quantile Aggregation
Figure 3 for Deep Quantile Aggregation
Figure 4 for Deep Quantile Aggregation
Viaarxiv icon

Continuous Doubly Constrained Batch Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 23, 2021
Rasool Fakoor, Jonas Mueller, Pratik Chaudhari, Alexander J. Smola

Figure 1 for Continuous Doubly Constrained Batch Reinforcement Learning
Figure 2 for Continuous Doubly Constrained Batch Reinforcement Learning
Figure 3 for Continuous Doubly Constrained Batch Reinforcement Learning
Figure 4 for Continuous Doubly Constrained Batch Reinforcement Learning
Viaarxiv icon