Alert button
Picture for Ahmed Touati

Ahmed Touati

Alert button

Simple Ingredients for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 19, 2024
Edoardo Cetin, Andrea Tirinzoni, Matteo Pirotta, Alessandro Lazaric, Yann Ollivier, Ahmed Touati

Figure 1 for Simple Ingredients for Offline Reinforcement Learning
Figure 2 for Simple Ingredients for Offline Reinforcement Learning
Figure 3 for Simple Ingredients for Offline Reinforcement Learning
Figure 4 for Simple Ingredients for Offline Reinforcement Learning
Viaarxiv icon

Score Models for Offline Goal-Conditioned Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 03, 2023
Harshit Sikchi, Rohan Chitnis, Ahmed Touati, Alborz Geramifard, Amy Zhang, Scott Niekum

Viaarxiv icon

A State Representation for Diminishing Rewards

Add code
Bookmark button
Alert button
Sep 07, 2023
Ted Moskovitz, Samo Hromadka, Ahmed Touati, Diana Borsa, Maneesh Sahani

Figure 1 for A State Representation for Diminishing Rewards
Figure 2 for A State Representation for Diminishing Rewards
Figure 3 for A State Representation for Diminishing Rewards
Figure 4 for A State Representation for Diminishing Rewards
Viaarxiv icon

Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

Add code
Bookmark button
Alert button
Oct 24, 2022
Andrea Tirinzoni, Matteo Papini, Ahmed Touati, Alessandro Lazaric, Matteo Pirotta

Figure 1 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 2 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 3 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 4 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Viaarxiv icon

Does Zero-Shot Reinforcement Learning Exist?

Add code
Bookmark button
Alert button
Sep 29, 2022
Ahmed Touati, Jérémy Rapin, Yann Ollivier

Figure 1 for Does Zero-Shot Reinforcement Learning Exist?
Figure 2 for Does Zero-Shot Reinforcement Learning Exist?
Figure 3 for Does Zero-Shot Reinforcement Learning Exist?
Figure 4 for Does Zero-Shot Reinforcement Learning Exist?
Viaarxiv icon

Learning One Representation to Optimize All Rewards

Add code
Bookmark button
Alert button
Mar 14, 2021
Ahmed Touati, Yann Ollivier

Figure 1 for Learning One Representation to Optimize All Rewards
Figure 2 for Learning One Representation to Optimize All Rewards
Figure 3 for Learning One Representation to Optimize All Rewards
Figure 4 for Learning One Representation to Optimize All Rewards
Viaarxiv icon

Efficient Learning in Non-Stationary Linear Markov Decision Processes

Add code
Bookmark button
Alert button
Oct 24, 2020
Ahmed Touati, Pascal Vincent

Figure 1 for Efficient Learning in Non-Stationary Linear Markov Decision Processes
Viaarxiv icon

Maximum Reward Formulation In Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 08, 2020
Sai Krishna Gottipati, Yashaswi Pathak, Rohan Nuttall, Sahir, Raviteja Chunduru, Ahmed Touati, Sriram Ganapathi Subramanian, Matthew E. Taylor, Sarath Chandar

Figure 1 for Maximum Reward Formulation In Reinforcement Learning
Figure 2 for Maximum Reward Formulation In Reinforcement Learning
Figure 3 for Maximum Reward Formulation In Reinforcement Learning
Figure 4 for Maximum Reward Formulation In Reinforcement Learning
Viaarxiv icon

Sharp Analysis of Smoothed Bellman Error Embedding

Add code
Bookmark button
Alert button
Jul 07, 2020
Ahmed Touati, Pascal Vincent

Viaarxiv icon

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

Add code
Bookmark button
Alert button
Jul 06, 2020
Joshua Romoff, Peter Henderson, David Kanaa, Emmanuel Bengio, Ahmed Touati, Pierre-Luc Bacon, Joelle Pineau

Figure 1 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 2 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 3 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 4 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Viaarxiv icon