Picture for Flemming Kondrup

Flemming Kondrup

Noticing the Watcher: LLM Agents Can Infer CoT Monitoring from Blocking Feedback

Add code
Mar 14, 2026
Viaarxiv icon

Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning

Add code
Apr 24, 2025
Viaarxiv icon

Forecaster: Towards Temporally Abstract Tree-Search Planning from Pixels

Add code
Oct 16, 2023
Viaarxiv icon

Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning

Add code
Oct 05, 2022
Viaarxiv icon