Picture for Simon Holk

Simon Holk

FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions

Add code
Apr 14, 2025
Viaarxiv icon

PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning

Add code
Feb 23, 2024
Viaarxiv icon