Picture for Yanyong Zhang

Yanyong Zhang

Learning Surgical Robotic Manipulation with 3D Spatial Priors

Add code
Mar 04, 2026
Viaarxiv icon

End-to-End Simultaneous Dysarthric Speech Reconstruction with Frame-Level Adaptor and Multiple Wait-k Knowledge Distillation

Add code
Mar 02, 2026
Viaarxiv icon

DARS: Dysarthria-Aware Rhythm-Style Synthesis for ASR Enhancement

Add code
Mar 02, 2026
Viaarxiv icon

DreamWorld: Unified World Modeling in Video Generation

Add code
Feb 28, 2026
Viaarxiv icon

FAVLA: A Force-Adaptive Fast-Slow VLA model for Contact-Rich Robotic Manipulation

Add code
Feb 27, 2026
Viaarxiv icon

SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs

Add code
Feb 26, 2026
Viaarxiv icon

DAGS-SLAM: Dynamic-Aware 3DGS SLAM via Spatiotemporal Motion Probability and Uncertainty-Aware Scheduling

Add code
Feb 25, 2026
Viaarxiv icon

C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

Add code
Feb 16, 2026
Viaarxiv icon

ConsisDrive: Identity-Preserving Driving World Models for Video Generation by Instance Mask

Add code
Feb 03, 2026
Viaarxiv icon

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Add code
Feb 03, 2026
Viaarxiv icon