Picture for Yucheng Hu

Yucheng Hu

BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

Add code
Feb 11, 2026
Viaarxiv icon

DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching

Add code
Feb 05, 2026
Viaarxiv icon

CLM-Bench: Benchmarking and Analyzing Cross-lingual Misalignment of LLMs in Knowledge Editing

Add code
Jan 24, 2026
Viaarxiv icon

VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models

Add code
Jan 06, 2026
Viaarxiv icon

Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution

Add code
May 23, 2025
Figure 1 for Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
Figure 2 for Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
Figure 3 for Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
Figure 4 for Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
Viaarxiv icon

UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent

Add code
Jan 31, 2025
Figure 1 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 2 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 3 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Figure 4 for UP-VLA: A Unified Understanding and Prediction Model for Embodied Agent
Viaarxiv icon

Improving Vision-Language-Action Model with Online Reinforcement Learning

Add code
Jan 28, 2025
Figure 1 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 2 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 3 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Figure 4 for Improving Vision-Language-Action Model with Online Reinforcement Learning
Viaarxiv icon

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations

Add code
Dec 19, 2024
Figure 1 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 2 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 3 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Figure 4 for Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Viaarxiv icon

Prediction with Action: Visual Policy Learning via Joint Denoising Process

Add code
Nov 27, 2024
Figure 1 for Prediction with Action: Visual Policy Learning via Joint Denoising Process
Figure 2 for Prediction with Action: Visual Policy Learning via Joint Denoising Process
Figure 3 for Prediction with Action: Visual Policy Learning via Joint Denoising Process
Figure 4 for Prediction with Action: Visual Policy Learning via Joint Denoising Process
Viaarxiv icon

RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing

Add code
Apr 30, 2024
Figure 1 for RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
Figure 2 for RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
Figure 3 for RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
Figure 4 for RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing
Viaarxiv icon