Picture for Mason Nakamura

Mason Nakamura

Aligning LLMs on a Budget: Inference-Time Alignment with Heuristic Reward Models

Add code
Aug 07, 2025
Viaarxiv icon

MAPLE: A Framework for Active Preference Learning Guided by Large Language Models

Add code
Dec 10, 2024
Viaarxiv icon