Picture for Stephane Collot

Stephane Collot

Jack

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic

Add code
Dec 08, 2025
Figure 1 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Figure 2 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Figure 3 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Figure 4 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon