Picture for Shuang Zhou

Shuang Zhou

Alphabetical order by last name

General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks

Add code
Apr 13, 2026
Viaarxiv icon

PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities

Add code
Apr 05, 2026
Viaarxiv icon

EpiScreen: Early Epilepsy Detection from Electronic Health Records with Large Language Models

Add code
Mar 30, 2026
Viaarxiv icon

MedCL-Bench: Benchmarking stability-efficiency trade-offs and scaling in biomedical continual learning

Add code
Mar 17, 2026
Viaarxiv icon

HeartAgent: An Autonomous Agent System for Explainable Differential Diagnosis in Cardiology

Add code
Mar 11, 2026
Viaarxiv icon

To Reason or Not to: Selective Chain-of-Thought in Medical Question Answering

Add code
Feb 23, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

MeCaMIL: Causality-Aware Multiple Instance Learning for Fair and Interpretable Whole Slide Image Diagnosis

Add code
Nov 14, 2025
Viaarxiv icon

AMO-Bench: Large Language Models Still Struggle in High School Math Competitions

Add code
Oct 30, 2025
Figure 1 for AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
Figure 2 for AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
Figure 3 for AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
Figure 4 for AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
Viaarxiv icon

Automating Expert-Level Medical Reasoning Evaluation of Large Language Models

Add code
Jul 10, 2025
Viaarxiv icon