Alert button
Picture for Remi Tachet des Combes

Remi Tachet des Combes

Alert button

Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 26, 2023
Hongyu Zang, Xin Li, Leiji Zhang, Yang Liu, Baigui Sun, Riashat Islam, Remi Tachet des Combes, Romain Laroche

Figure 1 for Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
Figure 2 for Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
Figure 3 for Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
Figure 4 for Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
Viaarxiv icon

Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information

Add code
Bookmark button
Alert button
Oct 31, 2022
Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford

Figure 1 for Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information
Figure 2 for Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information
Figure 3 for Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information
Figure 4 for Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information
Viaarxiv icon

Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 02, 2022
David Brandfonbrener, Remi Tachet des Combes, Romain Laroche

Figure 1 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Figure 2 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Figure 3 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Figure 4 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Viaarxiv icon

Non-Markovian policies occupancy measures

Add code
Bookmark button
Alert button
May 27, 2022
Romain Laroche, Remi Tachet des Combes, Jacob Buckman

Figure 1 for Non-Markovian policies occupancy measures
Viaarxiv icon

On the Regularity of Attention

Add code
Bookmark button
Alert button
Feb 10, 2021
James Vuckovic, Aristide Baratin, Remi Tachet des Combes

Viaarxiv icon

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms

Add code
Bookmark button
Alert button
Oct 02, 2020
Shangtong Zhang, Romain Laroche, Harm van Seijen, Shimon Whiteson, Remi Tachet des Combes

Figure 1 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 2 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 3 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 4 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Viaarxiv icon

A Mathematical Theory of Attention

Add code
Bookmark button
Alert button
Jul 06, 2020
James Vuckovic, Aristide Baratin, Remi Tachet des Combes

Viaarxiv icon

Deep Reinforcement and InfoMax Learning

Add code
Bookmark button
Alert button
Jun 12, 2020
Bogdan Mazoure, Remi Tachet des Combes, Thang Doan, Philip Bachman, R Devon Hjelm

Figure 1 for Deep Reinforcement and InfoMax Learning
Figure 2 for Deep Reinforcement and InfoMax Learning
Figure 3 for Deep Reinforcement and InfoMax Learning
Figure 4 for Deep Reinforcement and InfoMax Learning
Viaarxiv icon

Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift

Add code
Bookmark button
Alert button
Mar 10, 2020
Remi Tachet des Combes, Han Zhao, Yu-Xiang Wang, Geoff Gordon

Figure 1 for Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift
Figure 2 for Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift
Figure 3 for Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift
Figure 4 for Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift
Viaarxiv icon

A Reduction from Reinforcement Learning to No-Regret Online Learning

Add code
Bookmark button
Alert button
Jan 01, 2020
Ching-An Cheng, Remi Tachet des Combes, Byron Boots, Geoff Gordon

Viaarxiv icon