Picture for Théo Vincent

Théo Vincent

Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning With Iterated Q-Learning

Add code
Jun 04, 2025
Viaarxiv icon

Deep Reinforcement Learning Agents are not even close to Human Intelligence

Add code
May 27, 2025
Viaarxiv icon

Eau De $Q$-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning

Add code
Mar 03, 2025
Viaarxiv icon

Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning

Add code
May 25, 2024
Figure 1 for Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning
Figure 2 for Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning
Figure 3 for Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning
Figure 4 for Adaptive $Q$-Network: On-the-fly Target Selection for Deep Reinforcement Learning
Viaarxiv icon

Iterated $Q$-Network: Beyond the One-Step Bellman Operator

Add code
Mar 04, 2024
Figure 1 for Iterated $Q$-Network: Beyond the One-Step Bellman Operator
Figure 2 for Iterated $Q$-Network: Beyond the One-Step Bellman Operator
Figure 3 for Iterated $Q$-Network: Beyond the One-Step Bellman Operator
Figure 4 for Iterated $Q$-Network: Beyond the One-Step Bellman Operator
Viaarxiv icon

Parameterized Projected Bellman Operator

Add code
Dec 20, 2023
Figure 1 for Parameterized Projected Bellman Operator
Figure 2 for Parameterized Projected Bellman Operator
Figure 3 for Parameterized Projected Bellman Operator
Figure 4 for Parameterized Projected Bellman Operator
Viaarxiv icon