Picture for Edward James Young

Edward James Young

KL-Regularised Q-Learning: A Token-level Action-Value perspective on Online RLHF

Add code
Aug 23, 2025
Viaarxiv icon

AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in LLM-Based Agents

Add code
Jun 04, 2025
Viaarxiv icon

Reinforcement Learning applied to Insurance Portfolio Pursuit

Add code
Aug 02, 2024
Figure 1 for Reinforcement Learning applied to Insurance Portfolio Pursuit
Figure 2 for Reinforcement Learning applied to Insurance Portfolio Pursuit
Viaarxiv icon