Alert button

Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws

Feb 23, 2023
Kush Bhatia, Wenshuo Guo, Jacob Steinhardt

Figure 1 for Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws
Figure 2 for Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: