Picture for Xiao Chen

Xiao Chen

Imagine2Real: Towards Zero-shot Humanoid-Object Interaction via Video Generative Priors

Add code
May 21, 2026
Viaarxiv icon

CCD-Level and Load-Aware Thread Orchestration for In-Memory Vector ANNS on Multi-Core CPUs

Add code
May 11, 2026
Viaarxiv icon

Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism

Add code
Mar 31, 2026
Viaarxiv icon

GeoTikzBridge: Advancing Multimodal Code Generation for Geometric Perception and Reasoning

Add code
Mar 24, 2026
Viaarxiv icon

Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models

Add code
Mar 10, 2026
Viaarxiv icon

RADAR: Benchmarking Vision-Language-Action Generalization via Real-World Dynamics, Spatial-Physical Intelligence, and Autonomous Evaluation

Add code
Feb 11, 2026
Viaarxiv icon

FedAdaVR: Adaptive Variance Reduction for Robust Federated Learning under Limited Client Participation

Add code
Jan 29, 2026
Viaarxiv icon

PROST-LLM: Progressively Enhancing the Speech-to-Speech Translation Capability in LLMs

Add code
Jan 23, 2026
Viaarxiv icon

DSA-Tokenizer: Disentangled Semantic-Acoustic Tokenization via Flow Matching-based Hierarchical Fusion

Add code
Jan 15, 2026
Viaarxiv icon

LLHA-Net: A Hierarchical Attention Network for Two-View Correspondence Learning

Add code
Dec 31, 2025
Viaarxiv icon