FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions

Add code
Apr 14, 2025
Figure 1 for FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions
Figure 2 for FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions
Figure 3 for FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions
Figure 4 for FLoRA: Sample-Efficient Preference-based RL via Low-Rank Style Adaptation of Reward Functions

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: