Picture for Xiaoliang Peng

Xiaoliang Peng

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Reinforcement Learning from User Feedback

Add code
May 20, 2025
Viaarxiv icon