Picture for Zhongyu Wei

Zhongyu Wei

Fudan University

Doc-V*:Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA

Add code
Apr 15, 2026
Viaarxiv icon

MedRCube: A Multidimensional Framework for Fine-Grained and In-Depth Evaluation of MLLMs in Medical Imaging

Add code
Apr 15, 2026
Viaarxiv icon

SpatialAnt: Autonomous Zero-Shot Robot Navigation via Active Scene Reconstruction and Visual Anticipation

Add code
Mar 27, 2026
Viaarxiv icon

LifeSim: Long-Horizon User Life Simulator for Personalized Assistant Evaluation

Add code
Mar 12, 2026
Viaarxiv icon

Rethinking the Efficiency and Effectiveness of Reinforcement Learning for Radiology Report Generation

Add code
Mar 04, 2026
Viaarxiv icon

ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns

Add code
Feb 17, 2026
Viaarxiv icon

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Add code
Feb 13, 2026
Viaarxiv icon

Towards a Science of Collective AI: LLM-based Multi-Agent Systems Need a Transition from Blind Trial-and-Error to Rigorous Science

Add code
Feb 05, 2026
Viaarxiv icon

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Add code
Feb 02, 2026
Viaarxiv icon

CURP: Codebook-based Continuous User Representation for Personalized Generation with LLMs

Add code
Jan 31, 2026
Viaarxiv icon