Picture for Axel Højmark

Axel Højmark

Stress Testing Deliberative Alignment for Anti-Scheming Training

Add code
Sep 19, 2025
Viaarxiv icon

Forecasting Frontier Language Model Agent Capabilities

Add code
Feb 21, 2025
Figure 1 for Forecasting Frontier Language Model Agent Capabilities
Figure 2 for Forecasting Frontier Language Model Agent Capabilities
Figure 3 for Forecasting Frontier Language Model Agent Capabilities
Figure 4 for Forecasting Frontier Language Model Agent Capabilities
Viaarxiv icon

Analyzing Probabilistic Methods for Evaluating Agent Capabilities

Add code
Sep 24, 2024
Figure 1 for Analyzing Probabilistic Methods for Evaluating Agent Capabilities
Figure 2 for Analyzing Probabilistic Methods for Evaluating Agent Capabilities
Viaarxiv icon