Picture for Edward James Young

Edward James Young

A transformer architecture alteration to incentivise externalised reasoning

Add code
Mar 22, 2026
Viaarxiv icon

Questionnaire Responses Do not Capture the Safety of AI Agents

Add code
Mar 15, 2026
Viaarxiv icon

Diagnosing Pathological Chain-of-Thought in Reasoning Models

Add code
Feb 14, 2026
Viaarxiv icon

KL-Regularised Q-Learning: A Token-level Action-Value perspective on Online RLHF

Add code
Aug 23, 2025
Figure 1 for KL-Regularised Q-Learning: A Token-level Action-Value perspective on Online RLHF
Figure 2 for KL-Regularised Q-Learning: A Token-level Action-Value perspective on Online RLHF
Figure 3 for KL-Regularised Q-Learning: A Token-level Action-Value perspective on Online RLHF
Figure 4 for KL-Regularised Q-Learning: A Token-level Action-Value perspective on Online RLHF
Viaarxiv icon

AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in LLM-Based Agents

Add code
Jun 04, 2025
Viaarxiv icon

Reinforcement Learning applied to Insurance Portfolio Pursuit

Add code
Aug 02, 2024
Figure 1 for Reinforcement Learning applied to Insurance Portfolio Pursuit
Figure 2 for Reinforcement Learning applied to Insurance Portfolio Pursuit
Viaarxiv icon