Picture for Edward Choi

Edward Choi

K-FinHallu: A Hallucination Detection Benchmark for Multi-Turn RAG in Korean Finance

Add code
May 28, 2026
Viaarxiv icon

Towards Error-Free EHRs: Reasoning-Intensive Consistency Verification Between Clinical Notes and Structured Tables in Electronic Health Records

Add code
May 26, 2026
Viaarxiv icon

On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists

Add code
May 20, 2026
Viaarxiv icon

PICon: A Multi-Turn Interrogation Framework for Evaluating Persona Agent Consistency

Add code
Mar 26, 2026
Viaarxiv icon

ECG-Reasoning-Benchmark: A Benchmark for Evaluating Clinical Reasoning Capabilities in ECG Interpretation

Add code
Mar 15, 2026
Viaarxiv icon

CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays

Add code
Feb 26, 2026
Viaarxiv icon

KorMedMCQA-V: A Multimodal Benchmark for Evaluating Vision-Language Models on the Korean Medical Licensing Examination

Add code
Feb 14, 2026
Viaarxiv icon

H-AdminSim: A Multi-Agent Simulator for Realistic Hospital Administrative Workflows with FHIR Integration

Add code
Feb 05, 2026
Viaarxiv icon

ECG-Agent: On-Device Tool-Calling Agent for ECG Multi-Turn Dialogue

Add code
Jan 28, 2026
Viaarxiv icon

Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset

Add code
Nov 19, 2025
Viaarxiv icon