Picture for Sanmi Koyejo

Sanmi Koyejo

Stanford University

SCENEBench: An Audio Understanding Benchmark Grounded in Assistive and Industrial Use Cases

Add code
Mar 10, 2026
Viaarxiv icon

When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation

Add code
Feb 18, 2026
Viaarxiv icon

Discovering Implicit Large Language Model Alignment Objectives

Add code
Feb 17, 2026
Viaarxiv icon

ALMo: Interactive Aim-Limit-Defined, Multi-Objective System for Personalized High-Dose-Rate Brachytherapy Treatment Planning and Visualization for Cervical Cancer

Add code
Feb 14, 2026
Viaarxiv icon

Attention Head Entropy of LLMs Predicts Answer Correctness

Add code
Feb 14, 2026
Viaarxiv icon

Latent Adversarial Regularization for Offline Preference Optimization

Add code
Jan 29, 2026
Viaarxiv icon

Neural Nonmyopic Bayesian Optimization in Dynamic Cost Settings

Add code
Jan 10, 2026
Viaarxiv icon

Quantifying the Effect of Test Set Contamination on Generative Evaluations

Add code
Jan 07, 2026
Viaarxiv icon

Extracting books from production language models

Add code
Jan 06, 2026
Viaarxiv icon

End-to-End Test-Time Training for Long Context

Add code
Dec 31, 2025
Viaarxiv icon