Picture for Shuang Chen

Shuang Chen

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Add code
May 11, 2026
Viaarxiv icon

4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding

Add code
May 07, 2026
Viaarxiv icon

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Add code
May 06, 2026
Viaarxiv icon

Diffusion Model as a Generalist Segmentation Learner

Add code
Apr 27, 2026
Viaarxiv icon

Motion-Adaptive Multi-Scale Temporal Modelling with Skeleton-Constrained Spatial Graphs for Efficient 3D Human Pose Estimation

Add code
Apr 04, 2026
Viaarxiv icon

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

Add code
Apr 03, 2026
Viaarxiv icon

Revealing the Learning Dynamics of Long-Context Continual Pre-training

Add code
Apr 03, 2026
Viaarxiv icon

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Add code
Apr 01, 2026
Viaarxiv icon

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Add code
Mar 30, 2026
Viaarxiv icon

UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation

Add code
Mar 23, 2026
Viaarxiv icon