Picture for Quentin Delfosse

Quentin Delfosse

Kintsugi: Learning Policies by Repairing Executable Knowledge Bases

Add code
May 10, 2026
Viaarxiv icon

LLMs Gaming Verifiers: RLVR can Lead to Reward Hacking

Add code
Apr 16, 2026
Viaarxiv icon

STORM: Segment, Track, and Object Re-Localization from a Single 3D Model

Add code
Nov 12, 2025
Viaarxiv icon

Deep Reinforcement Learning Agents are not even close to Human Intelligence

Add code
May 27, 2025
Viaarxiv icon

Better Decisions through the Right Causal World Model

Add code
Apr 09, 2025
Viaarxiv icon

Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs

Add code
Mar 11, 2025
Figure 1 for Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs
Figure 2 for Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs
Figure 3 for Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs
Figure 4 for Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs
Viaarxiv icon

Interpretable end-to-end Neurosymbolic Reinforcement Learning agents

Add code
Oct 18, 2024
Figure 1 for Interpretable end-to-end Neurosymbolic Reinforcement Learning agents
Figure 2 for Interpretable end-to-end Neurosymbolic Reinforcement Learning agents
Figure 3 for Interpretable end-to-end Neurosymbolic Reinforcement Learning agents
Figure 4 for Interpretable end-to-end Neurosymbolic Reinforcement Learning agents
Viaarxiv icon

BlendRL: A Framework for Merging Symbolic and Neural Policy Learning

Add code
Oct 15, 2024
Figure 1 for BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Figure 2 for BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Figure 3 for BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Figure 4 for BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Viaarxiv icon

OCALM: Object-Centric Assessment with Language Models

Add code
Jun 24, 2024
Figure 1 for OCALM: Object-Centric Assessment with Language Models
Figure 2 for OCALM: Object-Centric Assessment with Language Models
Figure 3 for OCALM: Object-Centric Assessment with Language Models
Figure 4 for OCALM: Object-Centric Assessment with Language Models
Viaarxiv icon

EXPIL: Explanatory Predicate Invention for Learning in Games

Add code
Jun 10, 2024
Viaarxiv icon