Picture for Zongyuan Ge

Zongyuan Ge

Fundus Image-based Glaucoma Screening via Retinal Knowledge-Oriented Dynamic Multi-Level Feature Integration

Add code
Apr 14, 2026
Viaarxiv icon

Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models

Add code
Apr 11, 2026
Viaarxiv icon

Do No Harm: Exposing Hidden Vulnerabilities of LLMs via Persona-based Client Simulation Attack in Psychological Counseling

Add code
Apr 06, 2026
Viaarxiv icon

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Add code
Mar 29, 2026
Viaarxiv icon

MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences

Add code
Mar 29, 2026
Viaarxiv icon

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

Add code
Mar 09, 2026
Viaarxiv icon

OPGAgent: An Agent for Auditable Dental Panoramic X-ray Interpretation

Add code
Feb 28, 2026
Viaarxiv icon

LATA: Laplacian-Assisted Transductive Adaptation for Conformal Uncertainty in Medical VLMs

Add code
Feb 19, 2026
Viaarxiv icon

A Vision-Language Foundation Model for Zero-shot Clinical Collaboration and Automated Concept Discovery in Dermatology

Add code
Feb 11, 2026
Viaarxiv icon

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Add code
Feb 09, 2026
Viaarxiv icon