Picture for Sanmi Koyejo

Sanmi Koyejo

Stanford University

CURE: Cultural Understanding and Reasoning Evaluation - A Framework for "Thick" Culture Alignment Evaluation in LLMs

Add code
Nov 15, 2025
Viaarxiv icon

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations

Add code
Nov 06, 2025
Viaarxiv icon

Understanding Adversarial Transfer: Why Representation-Space Attacks Fail Where Data-Space Attacks Succeed

Add code
Oct 01, 2025
Figure 1 for Understanding Adversarial Transfer: Why Representation-Space Attacks Fail Where Data-Space Attacks Succeed
Figure 2 for Understanding Adversarial Transfer: Why Representation-Space Attacks Fail Where Data-Space Attacks Succeed
Figure 3 for Understanding Adversarial Transfer: Why Representation-Space Attacks Fail Where Data-Space Attacks Succeed
Figure 4 for Understanding Adversarial Transfer: Why Representation-Space Attacks Fail Where Data-Space Attacks Succeed
Viaarxiv icon

UQ: Assessing Language Models on Unsolved Questions

Add code
Aug 25, 2025
Viaarxiv icon

Algorithmic Fairness amid Social Determinants: Reflection, Characterization, and Approach

Add code
Aug 10, 2025
Figure 1 for Algorithmic Fairness amid Social Determinants: Reflection, Characterization, and Approach
Figure 2 for Algorithmic Fairness amid Social Determinants: Reflection, Characterization, and Approach
Figure 3 for Algorithmic Fairness amid Social Determinants: Reflection, Characterization, and Approach
Figure 4 for Algorithmic Fairness amid Social Determinants: Reflection, Characterization, and Approach
Viaarxiv icon

Let's Measure Information Step-by-Step: LLM-Based Evaluation Beyond Vibes

Add code
Aug 07, 2025
Viaarxiv icon

From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?

Add code
Jun 09, 2025
Figure 1 for From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?
Figure 2 for From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?
Figure 3 for From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?
Figure 4 for From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?
Viaarxiv icon

Certified Unlearning for Neural Networks

Add code
Jun 08, 2025
Figure 1 for Certified Unlearning for Neural Networks
Figure 2 for Certified Unlearning for Neural Networks
Figure 3 for Certified Unlearning for Neural Networks
Figure 4 for Certified Unlearning for Neural Networks
Viaarxiv icon

The Optimization Paradox in Clinical AI Multi-Agent Systems

Add code
Jun 06, 2025
Viaarxiv icon

Understanding challenges to the interpretation of disaggregated evaluations of algorithmic fairness

Add code
Jun 04, 2025
Viaarxiv icon