Picture for Xiaoyu Chen

Xiaoyu Chen

TableVision: A Large-Scale Benchmark for Spatially Grounded Reasoning over Complex Hierarchical Tables

Add code
Apr 04, 2026
Viaarxiv icon

Learning Additively Compositional Latent Actions for Embodied AI

Add code
Apr 03, 2026
Viaarxiv icon

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Add code
Mar 03, 2026
Viaarxiv icon

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

Add code
Feb 11, 2026
Viaarxiv icon

VideoAfford: Grounding 3D Affordance from Human-Object-Interaction Videos via Multimodal Large Language Model

Add code
Feb 10, 2026
Viaarxiv icon

VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models

Add code
Jan 06, 2026
Viaarxiv icon

How Do VLAs Effectively Inherit from VLMs?

Add code
Nov 10, 2025
Viaarxiv icon

Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation

Add code
Sep 04, 2025
Figure 1 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Figure 2 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Figure 3 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Figure 4 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Viaarxiv icon

Generative Annotation for ASR Named Entity Correction

Add code
Aug 28, 2025
Figure 1 for Generative Annotation for ASR Named Entity Correction
Figure 2 for Generative Annotation for ASR Named Entity Correction
Figure 3 for Generative Annotation for ASR Named Entity Correction
Figure 4 for Generative Annotation for ASR Named Entity Correction
Viaarxiv icon

RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System

Add code
Aug 25, 2025
Figure 1 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Figure 2 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Figure 3 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Figure 4 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Viaarxiv icon