Picture for Zhe Hu

Zhe Hu

A Visually Impaired Assistance Benchmark for VLM-as-a-Judge Evaluation

Add code
May 29, 2026
Viaarxiv icon

RoHIL: Robust Human-in-the-Loop Robotic Reinforcement Learning Against Illumination Variations

Add code
May 19, 2026
Viaarxiv icon

An evaluation framework for sparse 4D (3D + time) imaging reconstruction via bootstrapped cross-validation

Add code
May 18, 2026
Viaarxiv icon

Overcoming Dynamics-Blindness: Training-Free Pace-and-Path Correction for VLA Models

Add code
May 14, 2026
Viaarxiv icon

Goal2Skill: Long-Horizon Manipulation with Adaptive Planning and Reflection

Add code
Apr 15, 2026
Viaarxiv icon

4D reconstruction of alumina laser melt pools at 25 kHz via operando X-ray multi-projection imaging

Add code
Mar 15, 2026
Viaarxiv icon

Guided by the Plan: Enhancing Faithful Autoregressive Text-to-Audio Generation with Guided Decoding

Add code
Jan 18, 2026
Viaarxiv icon

Exploring Scale Shift in Crowd Localization under the Context of Domain Generalization

Add code
Oct 22, 2025
Viaarxiv icon

Language Model Based Text-to-Audio Generation: Anti-Causally Aligned Collaborative Residual Transformers

Add code
Oct 06, 2025
Viaarxiv icon

Physics-informed 4D X-ray image reconstruction from ultra-sparse spatiotemporal data

Add code
Apr 04, 2025
Figure 1 for Physics-informed 4D X-ray image reconstruction from ultra-sparse spatiotemporal data
Figure 2 for Physics-informed 4D X-ray image reconstruction from ultra-sparse spatiotemporal data
Figure 3 for Physics-informed 4D X-ray image reconstruction from ultra-sparse spatiotemporal data
Figure 4 for Physics-informed 4D X-ray image reconstruction from ultra-sparse spatiotemporal data
Viaarxiv icon