Picture for William Bankes

William Bankes

LLM-WikiRace: Benchmarking Long-term Planning and Reasoning over Real-World Knowledge Graphs

Add code
Feb 18, 2026
Viaarxiv icon

LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations

Add code
Feb 10, 2026
Viaarxiv icon

Detecting High-Stakes Interactions with Activation Probes

Add code
Jun 12, 2025
Figure 1 for Detecting High-Stakes Interactions with Activation Probes
Figure 2 for Detecting High-Stakes Interactions with Activation Probes
Figure 3 for Detecting High-Stakes Interactions with Activation Probes
Figure 4 for Detecting High-Stakes Interactions with Activation Probes
Viaarxiv icon

Robust Multi-Objective Controlled Decoding of Large Language Models

Add code
Mar 11, 2025
Figure 1 for Robust Multi-Objective Controlled Decoding of Large Language Models
Figure 2 for Robust Multi-Objective Controlled Decoding of Large Language Models
Figure 3 for Robust Multi-Objective Controlled Decoding of Large Language Models
Figure 4 for Robust Multi-Objective Controlled Decoding of Large Language Models
Viaarxiv icon

Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift

Add code
Jul 26, 2024
Figure 1 for Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Figure 2 for Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Figure 3 for Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Figure 4 for Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Viaarxiv icon

REDUCR: Robust Data Downsampling Using Class Priority Reweighting

Add code
Dec 01, 2023
Viaarxiv icon