Picture for Tom Bewley

Tom Bewley

ShapShift: Explaining Model Prediction Shifts with Subgroup Conditional Shapley Values

Add code
Apr 13, 2026
Viaarxiv icon

Voxtral TTS

Add code
Mar 26, 2026
Viaarxiv icon

Voxtral Realtime

Add code
Feb 11, 2026
Viaarxiv icon

Ministral 3

Add code
Jan 13, 2026
Viaarxiv icon

Voxtral

Add code
Jul 17, 2025
Viaarxiv icon

Zero-Shot Reinforcement Learning Under Partial Observability

Add code
Jun 18, 2025
Viaarxiv icon

Sequential Harmful Shift Detection Without Labels

Add code
Dec 17, 2024
Figure 1 for Sequential Harmful Shift Detection Without Labels
Figure 2 for Sequential Harmful Shift Detection Without Labels
Figure 3 for Sequential Harmful Shift Detection Without Labels
Figure 4 for Sequential Harmful Shift Detection Without Labels
Viaarxiv icon

Interpreting Language Reward Models via Contrastive Explanations

Add code
Nov 25, 2024
Figure 1 for Interpreting Language Reward Models via Contrastive Explanations
Figure 2 for Interpreting Language Reward Models via Contrastive Explanations
Figure 3 for Interpreting Language Reward Models via Contrastive Explanations
Figure 4 for Interpreting Language Reward Models via Contrastive Explanations
Viaarxiv icon

Counterfactual Metarules for Local and Global Recourse

Add code
May 29, 2024
Figure 1 for Counterfactual Metarules for Local and Global Recourse
Figure 2 for Counterfactual Metarules for Local and Global Recourse
Figure 3 for Counterfactual Metarules for Local and Global Recourse
Figure 4 for Counterfactual Metarules for Local and Global Recourse
Viaarxiv icon

Conservative World Models

Add code
Sep 26, 2023
Figure 1 for Conservative World Models
Figure 2 for Conservative World Models
Figure 3 for Conservative World Models
Figure 4 for Conservative World Models
Viaarxiv icon