Picture for Xin Chen

Xin Chen

Univ. California, Santa Barbara

Seedance 2.0: Advancing Video Generation for World Complexity

Add code
Apr 15, 2026
Viaarxiv icon

Search-MIND: Training-Free Multi-Modal Medical Image Registration

Add code
Apr 10, 2026
Viaarxiv icon

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Add code
Apr 09, 2026
Viaarxiv icon

MARS-Dragonfly: Agile and Robust Flight Control of Modular Aerial Robot Systems

Add code
Apr 07, 2026
Viaarxiv icon

NS-RGS: Newton-Schulz based Riemannian gradient method for orthogonal group synchronization

Add code
Apr 07, 2026
Viaarxiv icon

Q-Mask: Query-driven Causal Masks for Text Anchoring in OCR-Oriented Vision-Language Models

Add code
Mar 31, 2026
Viaarxiv icon

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Add code
Mar 29, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

Neuron-Aware Data Selection In Instruction Tuning For Large Language Models

Add code
Mar 13, 2026
Viaarxiv icon

Fish Audio S2 Technical Report

Add code
Mar 11, 2026
Viaarxiv icon