Picture for Murdock Aubry

Murdock Aubry

Trust the Batch, On- or Off-Policy: Adaptive Policy Optimization for RL Post-Training

Add code
May 12, 2026
Viaarxiv icon

Transformer Alignment in Large Language Models

Add code
Jul 10, 2024
Figure 1 for Transformer Alignment in Large Language Models
Figure 2 for Transformer Alignment in Large Language Models
Figure 3 for Transformer Alignment in Large Language Models
Figure 4 for Transformer Alignment in Large Language Models
Viaarxiv icon