Picture for Kavosh Asadi

Kavosh Asadi

Learning the Target Network in Function Space

Add code
Jun 03, 2024
Viaarxiv icon

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models

Add code
Oct 09, 2023
Figure 1 for TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
Figure 2 for TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
Figure 3 for TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
Figure 4 for TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models
Viaarxiv icon

Resetting the Optimizer in Deep RL: An Empirical Study

Add code
Jun 30, 2023
Figure 1 for Resetting the Optimizer in Deep RL: An Empirical Study
Figure 2 for Resetting the Optimizer in Deep RL: An Empirical Study
Figure 3 for Resetting the Optimizer in Deep RL: An Empirical Study
Figure 4 for Resetting the Optimizer in Deep RL: An Empirical Study
Viaarxiv icon

TD Convergence: An Optimization Perspective

Add code
Jun 30, 2023
Figure 1 for TD Convergence: An Optimization Perspective
Viaarxiv icon

Characterizing the Action-Generalization Gap in Deep Q-Learning

Add code
May 11, 2022
Figure 1 for Characterizing the Action-Generalization Gap in Deep Q-Learning
Figure 2 for Characterizing the Action-Generalization Gap in Deep Q-Learning
Figure 3 for Characterizing the Action-Generalization Gap in Deep Q-Learning
Viaarxiv icon

Deep Q-Network with Proximal Iteration

Add code
Dec 10, 2021
Figure 1 for Deep Q-Network with Proximal Iteration
Figure 2 for Deep Q-Network with Proximal Iteration
Figure 3 for Deep Q-Network with Proximal Iteration
Figure 4 for Deep Q-Network with Proximal Iteration
Viaarxiv icon

Coarse-Grained Smoothness for RL in Metric Spaces

Add code
Oct 23, 2021
Figure 1 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 2 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 3 for Coarse-Grained Smoothness for RL in Metric Spaces
Figure 4 for Coarse-Grained Smoothness for RL in Metric Spaces
Viaarxiv icon

Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback

Add code
Sep 15, 2021
Figure 1 for Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback
Figure 2 for Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback
Viaarxiv icon

Learning State Abstractions for Transfer in Continuous Control

Add code
Feb 08, 2020
Figure 1 for Learning State Abstractions for Transfer in Continuous Control
Figure 2 for Learning State Abstractions for Transfer in Continuous Control
Figure 3 for Learning State Abstractions for Transfer in Continuous Control
Viaarxiv icon

Deep RBF Value Functions for Continuous Control

Add code
Feb 05, 2020
Figure 1 for Deep RBF Value Functions for Continuous Control
Figure 2 for Deep RBF Value Functions for Continuous Control
Figure 3 for Deep RBF Value Functions for Continuous Control
Figure 4 for Deep RBF Value Functions for Continuous Control
Viaarxiv icon