Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Valeria Capretti

Efficient Reinforcement Learning from Human Feedback via Bayesian Preference Inference

Nov 06, 2025

Matteo Cercola, Valeria Capretti, Simone Formentin

Figure 1 for Efficient Reinforcement Learning from Human Feedback via Bayesian Preference Inference

Figure 2 for Efficient Reinforcement Learning from Human Feedback via Bayesian Preference Inference

Figure 3 for Efficient Reinforcement Learning from Human Feedback via Bayesian Preference Inference

Figure 4 for Efficient Reinforcement Learning from Human Feedback via Bayesian Preference Inference

Abstract:Learning from human preferences is a cornerstone of aligning machine learning models with subjective human judgments. Yet, collecting such preference data is often costly and time-consuming, motivating the need for more efficient learning paradigms. Two established approaches offer complementary advantages: RLHF scales effectively to high-dimensional tasks such as LLM fine-tuning, while PBO achieves greater sample efficiency through active querying. We propose a hybrid framework that unifies RLHF's scalability with PBO's query efficiency by integrating an acquisition-driven module into the RLHF pipeline, thereby enabling active and sample-efficient preference gathering. We validate the proposed approach on two representative domains: (i) high-dimensional preference optimization and (ii) LLM fine-tuning. Experimental results demonstrate consistent improvements in both sample efficiency and overall performance across these tasks.

Via

Access Paper or Ask Questions