Picture for Emily Alsentzer

Emily Alsentzer

Massachusetts Institute of Technology, Harvard Medical School

Clinician input steers frontier AI models toward both accurate and harmful decisions

Add code
Mar 14, 2026
Viaarxiv icon

AI-generated data contamination erodes pathological variability and diagnostic reliability

Add code
Jan 21, 2026
Viaarxiv icon

Large Language Models for Large-Scale, Rigorous Qualitative Analysis in Applied Health Services Research

Add code
Jan 20, 2026
Viaarxiv icon

Training-Free Adaptation of New-Generation LLMs using Legacy Clinical Models

Add code
Jan 06, 2026
Viaarxiv icon

Monitoring Deployed AI Systems in Health Care

Add code
Dec 09, 2025
Figure 1 for Monitoring Deployed AI Systems in Health Care
Figure 2 for Monitoring Deployed AI Systems in Health Care
Figure 3 for Monitoring Deployed AI Systems in Health Care
Viaarxiv icon

Retrieval-Augmented Guardrails for AI-Drafted Patient-Portal Messages: Error Taxonomy Construction and Large-Scale Evaluation

Add code
Sep 26, 2025
Viaarxiv icon

MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks

Add code
May 26, 2025
Figure 1 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 2 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 3 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Figure 4 for MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
Viaarxiv icon

BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text

Add code
May 01, 2025
Figure 1 for BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text
Figure 2 for BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text
Figure 3 for BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text
Figure 4 for BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text
Viaarxiv icon

TIMER: Temporal Instruction Modeling and Evaluation for Longitudinal Clinical Records

Add code
Mar 06, 2025
Viaarxiv icon

Identifying Reasons for Contraceptive Switching from Real-World Data Using Large Language Models

Add code
Feb 06, 2024
Viaarxiv icon