Picture for Chao Zhang

Chao Zhang

refer to the report for detailed contributions

video-SALMONN-R$^3$: Learning to ReWatch, ReAsk, and ReAnswer for Efficient Video Understanding

Add code
Jun 23, 2026
Viaarxiv icon

Mem-World: Memory-Augmented Action-Conditioned World Models for Persistent Robot Manipulation

Add code
Jun 18, 2026
Viaarxiv icon

Route-Constrained Robust Fusion Estimation for MEMS/GNSS Integrated Navigation of Unmanned Ground Vehicles in GNSS Degraded Environments

Add code
Jun 18, 2026
Viaarxiv icon

Co-VLA: Coordination-Aware Structured Action Modeling for Dual-Arm Vision-Language-Action Systems

Add code
Jun 18, 2026
Viaarxiv icon

Augmenting Dysarthric Speech Severity Assessment with MOS Supervision

Add code
Jun 17, 2026
Viaarxiv icon

Wasserstein Convergence of ODE-Based Samplers in Decentralized Diffusion Model via Velocity Field Decomposition

Add code
Jun 14, 2026
Viaarxiv icon

Ling and Ring 2.6 Technical Report: Efficient and Instant Agentic Intelligence at Trillion-Parameter Scale

Add code
Jun 13, 2026
Viaarxiv icon

Towards Diverse Scientific Hypothesis Search with Large Language Models

Add code
Jun 09, 2026
Viaarxiv icon

TRL-Bench: Standardizing Cross-Paradigm Representation-Level Evaluation of Tabular Encoders

Add code
Jun 08, 2026
Viaarxiv icon

MotionEnhancer: Leveraging Video Diffusion for Motion-Enhanced Vision-Language Models

Add code
Jun 05, 2026
Viaarxiv icon