Picture for Anastasios N. Angelopoulos

Anastasios N. Angelopoulos

Music Arena: Live Evaluation for Text-to-Music

Add code
Jul 28, 2025
Viaarxiv icon

Cost-Optimal Active AI Model Evaluation

Add code
Jun 09, 2025
Viaarxiv icon

Search Arena: Analyzing Search-Augmented LLMs

Add code
Jun 05, 2025
Viaarxiv icon

Prompt-to-Leaderboard

Add code
Feb 20, 2025
Figure 1 for Prompt-to-Leaderboard
Figure 2 for Prompt-to-Leaderboard
Figure 3 for Prompt-to-Leaderboard
Figure 4 for Prompt-to-Leaderboard
Viaarxiv icon

Gradient Equilibrium in Online Learning: Theory and Applications

Add code
Jan 14, 2025
Figure 1 for Gradient Equilibrium in Online Learning: Theory and Applications
Figure 2 for Gradient Equilibrium in Online Learning: Theory and Applications
Figure 3 for Gradient Equilibrium in Online Learning: Theory and Applications
Figure 4 for Gradient Equilibrium in Online Learning: Theory and Applications
Viaarxiv icon

Theoretical Foundations of Conformal Prediction

Add code
Nov 18, 2024
Figure 1 for Theoretical Foundations of Conformal Prediction
Figure 2 for Theoretical Foundations of Conformal Prediction
Figure 3 for Theoretical Foundations of Conformal Prediction
Figure 4 for Theoretical Foundations of Conformal Prediction
Viaarxiv icon

How to Evaluate Reward Models for RLHF

Add code
Oct 18, 2024
Figure 1 for How to Evaluate Reward Models for RLHF
Figure 2 for How to Evaluate Reward Models for RLHF
Figure 3 for How to Evaluate Reward Models for RLHF
Figure 4 for How to Evaluate Reward Models for RLHF
Viaarxiv icon

Data-Adaptive Tradeoffs among Multiple Risks in Distribution-Free Prediction

Add code
Mar 28, 2024
Viaarxiv icon

AutoEval Done Right: Using Synthetic Data for Model Evaluation

Add code
Mar 09, 2024
Figure 1 for AutoEval Done Right: Using Synthetic Data for Model Evaluation
Figure 2 for AutoEval Done Right: Using Synthetic Data for Model Evaluation
Figure 3 for AutoEval Done Right: Using Synthetic Data for Model Evaluation
Viaarxiv icon

Wavefront Randomization Improves Deconvolution

Add code
Feb 13, 2024
Viaarxiv icon