Picture for Zongyuan Ge

Zongyuan Ge

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Viaarxiv icon

Controllable Skin Synthesis via Lesion-Focused Vector Autoregression Model

Add code
Aug 27, 2025
Viaarxiv icon

RationalVLA: A Rational Vision-Language-Action Model with Dual System

Add code
Jun 12, 2025
Viaarxiv icon

APTOS-2024 challenge report: Generation of synthetic 3D OCT images from fundus photographs

Add code
Jun 09, 2025
Viaarxiv icon

Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery

Add code
May 23, 2025
Viaarxiv icon

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding

Add code
May 22, 2025
Viaarxiv icon

MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment

Add code
May 14, 2025
Viaarxiv icon

Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model

Add code
May 13, 2025
Viaarxiv icon

Enhancing Fundus Image-based Glaucoma Screening via Dynamic Global-Local Feature Integration

Add code
Apr 01, 2025
Viaarxiv icon

ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos

Add code
Mar 20, 2025
Viaarxiv icon