Picture for Yutong Hu

Yutong Hu

Assistron: Bayesian Shared Autonomy with Off-the-shelf Vision-Language-Action Models

Add code
Jun 22, 2026
Viaarxiv icon

MAPS: Multi-Anchor Projection Similarity for Joint Vision-Language Geo-Localization

Add code
Jun 21, 2026
Viaarxiv icon

FF-JEPA: Long-Horizon Planning in World Models with Latent Planners

Add code
Jun 08, 2026
Viaarxiv icon

ELVIS: Ensemble-Calibrated Latent Imagination for Long-Horizon Visual MPC

Add code
May 06, 2026
Viaarxiv icon

AR-VLA: True Autoregressive Action Expert for Vision-Language-Action Models

Add code
Mar 10, 2026
Viaarxiv icon

Global Cross-Modal Geo-Localization: A Million-Scale Dataset and a Physical Consistency Learning Framework

Add code
Mar 09, 2026
Viaarxiv icon

Train a Multi-Task Diffusion Policy on RLBench-18 in One Day with One GPU

Add code
May 14, 2025
Viaarxiv icon

M$^3$PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model

Add code
Dec 07, 2024
Figure 1 for M$^3$PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model
Figure 2 for M$^3$PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model
Figure 3 for M$^3$PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model
Figure 4 for M$^3$PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model
Viaarxiv icon

Only One Relation Possible? Modeling the Ambiguity in Event Temporal Relation Extraction

Add code
Aug 14, 2024
Viaarxiv icon

ELLA: Empowering LLMs for Interpretable, Accurate and Informative Legal Advice

Add code
Aug 13, 2024
Figure 1 for ELLA: Empowering LLMs for Interpretable, Accurate and Informative Legal Advice
Figure 2 for ELLA: Empowering LLMs for Interpretable, Accurate and Informative Legal Advice
Figure 3 for ELLA: Empowering LLMs for Interpretable, Accurate and Informative Legal Advice
Figure 4 for ELLA: Empowering LLMs for Interpretable, Accurate and Informative Legal Advice
Viaarxiv icon