Picture for Serge Thilges

Serge Thilges

PAWS: Preference Learning with Advantage-Weighted Segments

Add code
Jun 10, 2026
Viaarxiv icon

Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning

Add code
Jan 21, 2024
Viaarxiv icon