Picture for Tom Bewley

Tom Bewley

Voxtral

Add code
Jul 17, 2025
Viaarxiv icon

Zero-Shot Reinforcement Learning Under Partial Observability

Add code
Jun 18, 2025
Viaarxiv icon

Sequential Harmful Shift Detection Without Labels

Add code
Dec 17, 2024
Figure 1 for Sequential Harmful Shift Detection Without Labels
Figure 2 for Sequential Harmful Shift Detection Without Labels
Figure 3 for Sequential Harmful Shift Detection Without Labels
Figure 4 for Sequential Harmful Shift Detection Without Labels
Viaarxiv icon

Interpreting Language Reward Models via Contrastive Explanations

Add code
Nov 25, 2024
Figure 1 for Interpreting Language Reward Models via Contrastive Explanations
Figure 2 for Interpreting Language Reward Models via Contrastive Explanations
Figure 3 for Interpreting Language Reward Models via Contrastive Explanations
Figure 4 for Interpreting Language Reward Models via Contrastive Explanations
Viaarxiv icon

Counterfactual Metarules for Local and Global Recourse

Add code
May 29, 2024
Figure 1 for Counterfactual Metarules for Local and Global Recourse
Figure 2 for Counterfactual Metarules for Local and Global Recourse
Figure 3 for Counterfactual Metarules for Local and Global Recourse
Figure 4 for Counterfactual Metarules for Local and Global Recourse
Viaarxiv icon

Conservative World Models

Add code
Sep 26, 2023
Figure 1 for Conservative World Models
Figure 2 for Conservative World Models
Figure 3 for Conservative World Models
Figure 4 for Conservative World Models
Viaarxiv icon

Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback

Add code
May 26, 2023
Figure 1 for Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Figure 2 for Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Figure 3 for Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Figure 4 for Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Viaarxiv icon

Reward Learning with Trees: Methods and Evaluation

Add code
Oct 03, 2022
Figure 1 for Reward Learning with Trees: Methods and Evaluation
Figure 2 for Reward Learning with Trees: Methods and Evaluation
Figure 3 for Reward Learning with Trees: Methods and Evaluation
Figure 4 for Reward Learning with Trees: Methods and Evaluation
Viaarxiv icon

Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning

Add code
May 30, 2022
Figure 1 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning
Figure 2 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning
Figure 3 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning
Figure 4 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning
Viaarxiv icon

Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction

Add code
Jan 17, 2022
Figure 1 for Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
Figure 2 for Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
Figure 3 for Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
Figure 4 for Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
Viaarxiv icon