Alert button

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

Add code
Bookmark button
Alert button
Feb 22, 2024
Kenneth Li, Samy Jelassi, Hugh Zhang, Sham Kakade, Martin Wattenberg, David Brandfonbrener

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: