Picture for Zongyuan Ge

Zongyuan Ge

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

Add code
May 25, 2026
Viaarxiv icon

Beyond Binary Success: A Diagnostic Meta-Evaluation Framework for Fine-Grained Manipulation

Add code
May 19, 2026
Viaarxiv icon

DermAgent: A Self-Reflective Agentic System for Dermatological Image Analysis with Multi-Tool Reasoning and Traceable Decision-Making

Add code
May 14, 2026
Viaarxiv icon

Fundus Image-based Glaucoma Screening via Retinal Knowledge-Oriented Dynamic Multi-Level Feature Integration

Add code
Apr 14, 2026
Viaarxiv icon

Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models

Add code
Apr 11, 2026
Viaarxiv icon

Do No Harm: Exposing Hidden Vulnerabilities of LLMs via Persona-based Client Simulation Attack in Psychological Counseling

Add code
Apr 06, 2026
Viaarxiv icon

MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences

Add code
Mar 29, 2026
Viaarxiv icon

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Add code
Mar 29, 2026
Viaarxiv icon

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

Add code
Mar 09, 2026
Viaarxiv icon

OPGAgent: An Agent for Auditable Dental Panoramic X-ray Interpretation

Add code
Feb 28, 2026
Viaarxiv icon