Picture for Sanmi Koyejo

Sanmi Koyejo

Stanford University

Discovering Implicit Large Language Model Alignment Objectives

Add code
Feb 17, 2026
Viaarxiv icon

Attention Head Entropy of LLMs Predicts Answer Correctness

Add code
Feb 14, 2026
Viaarxiv icon

ALMo: Interactive Aim-Limit-Defined, Multi-Objective System for Personalized High-Dose-Rate Brachytherapy Treatment Planning and Visualization for Cervical Cancer

Add code
Feb 14, 2026
Viaarxiv icon

Latent Adversarial Regularization for Offline Preference Optimization

Add code
Jan 29, 2026
Viaarxiv icon

Neural Nonmyopic Bayesian Optimization in Dynamic Cost Settings

Add code
Jan 10, 2026
Viaarxiv icon

Quantifying the Effect of Test Set Contamination on Generative Evaluations

Add code
Jan 07, 2026
Viaarxiv icon

Extracting books from production language models

Add code
Jan 06, 2026
Viaarxiv icon

End-to-End Test-Time Training for Long Context

Add code
Dec 31, 2025
Viaarxiv icon

CURE: Cultural Understanding and Reasoning Evaluation - A Framework for "Thick" Culture Alignment Evaluation in LLMs

Add code
Nov 15, 2025
Viaarxiv icon

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations

Add code
Nov 06, 2025
Figure 1 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 2 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 3 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 4 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Viaarxiv icon