Alert button
Picture for Tom Bewley

Tom Bewley

Alert button

Conservative World Models

Add code
Bookmark button
Alert button
Sep 26, 2023
Scott Jeen, Tom Bewley, Jonathan M. Cullen

Figure 1 for Conservative World Models
Figure 2 for Conservative World Models
Figure 3 for Conservative World Models
Figure 4 for Conservative World Models
Viaarxiv icon

Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
May 26, 2023
Tom Bewley, Jonathan Lawry, Arthur Richards

Figure 1 for Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Figure 2 for Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Figure 3 for Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Figure 4 for Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback
Viaarxiv icon

Reward Learning with Trees: Methods and Evaluation

Add code
Bookmark button
Alert button
Oct 03, 2022
Tom Bewley, Jonathan Lawry, Arthur Richards, Rachel Craddock, Ian Henderson

Figure 1 for Reward Learning with Trees: Methods and Evaluation
Figure 2 for Reward Learning with Trees: Methods and Evaluation
Figure 3 for Reward Learning with Trees: Methods and Evaluation
Figure 4 for Reward Learning with Trees: Methods and Evaluation
Viaarxiv icon

Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning

Add code
Bookmark button
Alert button
May 30, 2022
Joseph Early, Tom Bewley, Christine Evers, Sarvapali Ramchurn

Figure 1 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning
Figure 2 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning
Figure 3 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning
Figure 4 for Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning
Viaarxiv icon

Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction

Add code
Bookmark button
Alert button
Jan 17, 2022
Tom Bewley, Jonathan Lawry, Arthur Richards

Figure 1 for Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
Figure 2 for Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
Figure 3 for Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
Figure 4 for Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
Viaarxiv icon

Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions

Add code
Bookmark button
Alert button
Dec 20, 2021
Tom Bewley, Freddy Lecue

Figure 1 for Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions
Figure 2 for Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions
Figure 3 for Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions
Figure 4 for Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions
Viaarxiv icon

TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments

Add code
Bookmark button
Alert button
Sep 21, 2020
Tom Bewley, Jonathan Lawry

Figure 1 for TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments
Figure 2 for TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments
Figure 3 for TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments
Figure 4 for TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments
Viaarxiv icon

Am I Building a White Box Agent or Interpreting a Black Box Agent?

Add code
Bookmark button
Alert button
Jul 08, 2020
Tom Bewley

Viaarxiv icon

Modelling Agent Policies with Interpretable Imitation Learning

Add code
Bookmark button
Alert button
Jun 19, 2020
Tom Bewley, Jonathan Lawry, Arthur Richards

Figure 1 for Modelling Agent Policies with Interpretable Imitation Learning
Figure 2 for Modelling Agent Policies with Interpretable Imitation Learning
Figure 3 for Modelling Agent Policies with Interpretable Imitation Learning
Viaarxiv icon