Picture for Max Wilcoxson

Max Wilcoxson

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration

Add code
Oct 23, 2024
Figure 1 for Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Figure 2 for Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Figure 3 for Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Figure 4 for Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Viaarxiv icon

Polynomial Regression as a Task for Understanding In-context Learning Through Finetuning and Alignment

Add code
Jul 27, 2024
Viaarxiv icon