Picture for Kun Zhao

Kun Zhao

for the Alzheimer's Disease Neuroimaging Initiative

Listen, Look, Drive: Coupling Audio Instructions for User-aware VLA-based Autonomous Driving

Add code
Jan 17, 2026
Viaarxiv icon

FigEx2: Visual-Conditioned Panel Detection and Captioning for Scientific Compound Figures

Add code
Jan 12, 2026
Viaarxiv icon

Aligning Findings with Diagnosis: A Self-Consistent Reinforcement Learning Framework for Trustworthy Radiology Reporting

Add code
Jan 06, 2026
Viaarxiv icon

R-GenIMA: Integrating Neuroimaging and Genetics with Interpretable Multimodal AI for Alzheimer's Disease Progression

Add code
Dec 22, 2025
Viaarxiv icon

Why Text Prevails: Vision May Undermine Multimodal Medical Decision Making

Add code
Dec 15, 2025
Viaarxiv icon

DRE: An Effective Dual-Refined Method for Integrating Small and Large Language Models in Open-Domain Dialogue Evaluation

Add code
Jun 04, 2025
Viaarxiv icon

LMFormer: Lane based Motion Prediction Transformer

Add code
Apr 14, 2025
Figure 1 for LMFormer: Lane based Motion Prediction Transformer
Viaarxiv icon

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Add code
Apr 07, 2025
Viaarxiv icon

One Snapshot is All You Need: A Generalized Method for mmWave Signal Generation

Add code
Mar 27, 2025
Viaarxiv icon

GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism

Add code
Jan 14, 2025
Figure 1 for GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism
Figure 2 for GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism
Figure 3 for GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism
Figure 4 for GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism
Viaarxiv icon