Picture for Ian Osband

Ian Osband

Tony

Delightful Exploration

Add code
May 13, 2026
Viaarxiv icon

Delightful Gradients Accelerate Corner Escape

Add code
May 12, 2026
Viaarxiv icon

Delightful Distributed Policy Gradient

Add code
Mar 20, 2026
Viaarxiv icon

Does This Gradient Spark Joy?

Add code
Mar 20, 2026
Viaarxiv icon

Delightful Policy Gradient

Add code
Mar 15, 2026
Viaarxiv icon

OpenAI o1 System Card

Add code
Dec 21, 2024
Figure 1 for OpenAI o1 System Card
Figure 2 for OpenAI o1 System Card
Figure 3 for OpenAI o1 System Card
Figure 4 for OpenAI o1 System Card
Viaarxiv icon

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

Approximate Thompson Sampling via Epistemic Neural Networks

Add code
Feb 18, 2023
Figure 1 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 2 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 3 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 4 for Approximate Thompson Sampling via Epistemic Neural Networks
Viaarxiv icon

Fine-Tuning Language Models via Epistemic Neural Networks

Add code
Nov 03, 2022
Figure 1 for Fine-Tuning Language Models via Epistemic Neural Networks
Figure 2 for Fine-Tuning Language Models via Epistemic Neural Networks
Figure 3 for Fine-Tuning Language Models via Epistemic Neural Networks
Figure 4 for Fine-Tuning Language Models via Epistemic Neural Networks
Viaarxiv icon

Robustness of Epinets against Distributional Shifts

Add code
Jul 01, 2022
Figure 1 for Robustness of Epinets against Distributional Shifts
Figure 2 for Robustness of Epinets against Distributional Shifts
Figure 3 for Robustness of Epinets against Distributional Shifts
Figure 4 for Robustness of Epinets against Distributional Shifts
Viaarxiv icon