Picture for Chenhao Tan

Chenhao Tan

From Feedback to Checklists: Grounded Evaluation of AI-Generated Clinical Notes

Add code
Jul 23, 2025
Viaarxiv icon

AbsenceBench: Language Models Can't Tell What's Missing

Add code
Jun 13, 2025
Viaarxiv icon

The Curious Language Model: Strategic Test-Time Information Acquisition

Add code
Jun 10, 2025
Viaarxiv icon

CLEAR: A Clinically-Grounded Tabular Framework for Radiology Report Evaluation

Add code
May 22, 2025
Viaarxiv icon

Concept Incongruence: An Exploration of Time and Death in Role Playing

Add code
May 20, 2025
Viaarxiv icon

HyPerAlign: Hypotheses-driven Personalized Alignment

Add code
Apr 29, 2025
Viaarxiv icon

HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation

Add code
Apr 15, 2025
Viaarxiv icon

HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation

Add code
Apr 09, 2025
Viaarxiv icon

On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions

Add code
Apr 07, 2025
Viaarxiv icon

CaseSumm: A Large-Scale Dataset for Long-Context Summarization from U.S. Supreme Court Opinions

Add code
Dec 30, 2024
Viaarxiv icon