Picture for Xiaoyu Chen

Xiaoyu Chen

UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Add code
Mar 03, 2026
Viaarxiv icon

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

Add code
Feb 11, 2026
Viaarxiv icon

VideoAfford: Grounding 3D Affordance from Human-Object-Interaction Videos via Multimodal Large Language Model

Add code
Feb 10, 2026
Viaarxiv icon

VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models

Add code
Jan 06, 2026
Viaarxiv icon

How Do VLAs Effectively Inherit from VLMs?

Add code
Nov 10, 2025
Viaarxiv icon

Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation

Add code
Sep 04, 2025
Figure 1 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Figure 2 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Figure 3 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Figure 4 for Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation
Viaarxiv icon

Generative Annotation for ASR Named Entity Correction

Add code
Aug 28, 2025
Figure 1 for Generative Annotation for ASR Named Entity Correction
Figure 2 for Generative Annotation for ASR Named Entity Correction
Figure 3 for Generative Annotation for ASR Named Entity Correction
Figure 4 for Generative Annotation for ASR Named Entity Correction
Viaarxiv icon

RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System

Add code
Aug 25, 2025
Figure 1 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Figure 2 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Figure 3 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Figure 4 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Viaarxiv icon

Large Foundation Model for Ads Recommendation

Add code
Aug 20, 2025
Figure 1 for Large Foundation Model for Ads Recommendation
Figure 2 for Large Foundation Model for Ads Recommendation
Figure 3 for Large Foundation Model for Ads Recommendation
Figure 4 for Large Foundation Model for Ads Recommendation
Viaarxiv icon

SSEmb: A Joint Structural and Semantic Embedding Framework for Mathematical Formula Retrieval

Add code
Aug 06, 2025
Viaarxiv icon