Picture for Zuozhu Liu

Zuozhu Liu

How Far Are Video Models from True Multimodal Reasoning?

Add code
Apr 21, 2026
Viaarxiv icon

PBE-UNet: A light weight Progressive Boundary-Enhanced U-Net with Scale-Aware Aggregation for Ultrasound Image Segmentation

Add code
Apr 15, 2026
Viaarxiv icon

Scaling Video Pretraining for Surgical Foundation Models

Add code
Apr 02, 2026
Viaarxiv icon

HICT: High-precision 3D CBCT reconstruction from a single X-ray

Add code
Apr 01, 2026
Viaarxiv icon

Optimsyn: Influence-Guided Rubrics Optimization for Synthetic Data Generation

Add code
Apr 01, 2026
Viaarxiv icon

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Add code
Mar 29, 2026
Viaarxiv icon

Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos

Add code
Mar 18, 2026
Viaarxiv icon

IOSVLM: A 3D Vision-Language Model for Unified Dental Diagnosis from Intraoral Scans

Add code
Mar 17, 2026
Viaarxiv icon

Bridging the Skill Gap in Clinical CBCT Interpretation with CBCTRepD

Add code
Mar 11, 2026
Viaarxiv icon

GCAgent: Enhancing Group Chat Communication through Dialogue Agents System

Add code
Mar 05, 2026
Viaarxiv icon