Alert button
Picture for Vikramjeet Das

Vikramjeet Das

Alert button

Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration

Add code
Bookmark button
Alert button
Dec 01, 2023
Viraj Mehta, Vikramjeet Das, Ojash Neopane, Yijia Dai, Ilija Bogunovic, Jeff Schneider, Willie Neiswanger

Viaarxiv icon

Kernelized Offline Contextual Dueling Bandits

Add code
Bookmark button
Alert button
Jul 21, 2023
Viraj Mehta, Ojash Neopane, Vikramjeet Das, Sen Lin, Jeff Schneider, Willie Neiswanger

Figure 1 for Kernelized Offline Contextual Dueling Bandits
Figure 2 for Kernelized Offline Contextual Dueling Bandits
Figure 3 for Kernelized Offline Contextual Dueling Bandits
Viaarxiv icon