Picture for Ahmed Touati

Ahmed Touati

Simple Ingredients for Offline Reinforcement Learning

Add code
Mar 19, 2024
Figure 1 for Simple Ingredients for Offline Reinforcement Learning
Figure 2 for Simple Ingredients for Offline Reinforcement Learning
Figure 3 for Simple Ingredients for Offline Reinforcement Learning
Figure 4 for Simple Ingredients for Offline Reinforcement Learning
Viaarxiv icon

Score Models for Offline Goal-Conditioned Reinforcement Learning

Add code
Nov 03, 2023
Viaarxiv icon

A State Representation for Diminishing Rewards

Add code
Sep 07, 2023
Figure 1 for A State Representation for Diminishing Rewards
Figure 2 for A State Representation for Diminishing Rewards
Figure 3 for A State Representation for Diminishing Rewards
Figure 4 for A State Representation for Diminishing Rewards
Viaarxiv icon

Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

Add code
Oct 24, 2022
Figure 1 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 2 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 3 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 4 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Viaarxiv icon

Does Zero-Shot Reinforcement Learning Exist?

Add code
Sep 29, 2022
Figure 1 for Does Zero-Shot Reinforcement Learning Exist?
Figure 2 for Does Zero-Shot Reinforcement Learning Exist?
Figure 3 for Does Zero-Shot Reinforcement Learning Exist?
Figure 4 for Does Zero-Shot Reinforcement Learning Exist?
Viaarxiv icon

Learning One Representation to Optimize All Rewards

Add code
Mar 14, 2021
Figure 1 for Learning One Representation to Optimize All Rewards
Figure 2 for Learning One Representation to Optimize All Rewards
Figure 3 for Learning One Representation to Optimize All Rewards
Figure 4 for Learning One Representation to Optimize All Rewards
Viaarxiv icon

Efficient Learning in Non-Stationary Linear Markov Decision Processes

Add code
Oct 24, 2020
Figure 1 for Efficient Learning in Non-Stationary Linear Markov Decision Processes
Viaarxiv icon

Maximum Reward Formulation In Reinforcement Learning

Add code
Oct 08, 2020
Figure 1 for Maximum Reward Formulation In Reinforcement Learning
Figure 2 for Maximum Reward Formulation In Reinforcement Learning
Figure 3 for Maximum Reward Formulation In Reinforcement Learning
Figure 4 for Maximum Reward Formulation In Reinforcement Learning
Viaarxiv icon

Sharp Analysis of Smoothed Bellman Error Embedding

Add code
Jul 07, 2020
Viaarxiv icon

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

Add code
Jul 06, 2020
Figure 1 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 2 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 3 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 4 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Viaarxiv icon