Medqa Usmle


Measuring Black-Box Confidence via Reasoning Trajectories: Geometry, Coverage, and Verbalization

Add code
May 07, 2026
Viaarxiv icon

Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale

Add code
Apr 26, 2026
Viaarxiv icon

A Systematic Study of Retrieval Pipeline Design for Retrieval-Augmented Medical Question Answering

Add code
Apr 08, 2026
Viaarxiv icon

Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA

Add code
Mar 25, 2026
Viaarxiv icon

To Reason or Not to: Selective Chain-of-Thought in Medical Question Answering

Add code
Feb 23, 2026
Viaarxiv icon

Capabilities of GPT-5 on Multimodal Medical Reasoning

Add code
Aug 13, 2025
Viaarxiv icon

WiNGPT-3.0 Technical Report

Add code
May 23, 2025
Figure 1 for WiNGPT-3.0 Technical Report
Figure 2 for WiNGPT-3.0 Technical Report
Figure 3 for WiNGPT-3.0 Technical Report
Figure 4 for WiNGPT-3.0 Technical Report
Viaarxiv icon

Disentangling Reasoning and Knowledge in Medical Large Language Models

Add code
May 16, 2025
Figure 1 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 2 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 3 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 4 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Viaarxiv icon

Open-Medical-R1: How to Choose Data for RLVR Training at Medicine Domain

Add code
Apr 16, 2025
Figure 1 for Open-Medical-R1: How to Choose Data for RLVR Training at Medicine Domain
Figure 2 for Open-Medical-R1: How to Choose Data for RLVR Training at Medicine Domain
Figure 3 for Open-Medical-R1: How to Choose Data for RLVR Training at Medicine Domain
Figure 4 for Open-Medical-R1: How to Choose Data for RLVR Training at Medicine Domain
Viaarxiv icon

AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset

Add code
Nov 23, 2024
Figure 1 for AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset
Figure 2 for AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset
Figure 3 for AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset
Figure 4 for AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset
Viaarxiv icon