Picture for Yao Zhang

Yao Zhang

Shanghai AI Laboratory, China

RoboSemanticBench: Diagnosing Semantic Grounding in Action Prediction for VLA Models

Add code
Jun 01, 2026
Viaarxiv icon

ProactiveLLM: Learning Active Interaction for Streaming Large Language Models

Add code
May 30, 2026
Viaarxiv icon

Multi-Fidelity Quantile Regression

Add code
May 11, 2026
Viaarxiv icon

Learning Generalizable Multimodal Representations for Software Vulnerability Detection

Add code
Apr 28, 2026
Viaarxiv icon

Encoder-Free Human Motion Understanding via Structured Motion Descriptions

Add code
Apr 23, 2026
Viaarxiv icon

FAVE: Flow-based Average Velocity Establishment for Sequential Recommendation

Add code
Apr 06, 2026
Viaarxiv icon

Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment

Add code
Apr 01, 2026
Viaarxiv icon

LingoMotion: An Interpretable and Unambiguous Symbolic Representation for Human Motion

Add code
Mar 13, 2026
Viaarxiv icon

NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval

Add code
Mar 13, 2026
Viaarxiv icon

Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction

Add code
Mar 10, 2026
Viaarxiv icon