Picture for Zhihao Zhan

Zhihao Zhan

TAG: Target-Agnostic Guidance for Stable Object-Centric Inference in Vision-Language-Action Models

Add code
Mar 25, 2026
Viaarxiv icon

On the Derivation of Tightly-Coupled LiDAR-Inertial Odometry with VoxelMap

Add code
Mar 16, 2026
Viaarxiv icon

AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots

Add code
Mar 08, 2026
Viaarxiv icon

RADAR: Benchmarking Vision-Language-Action Generalization via Real-World Dynamics, Spatial-Physical Intelligence, and Autonomous Evaluation

Add code
Feb 11, 2026
Viaarxiv icon

Thermal odometry and dense mapping using learned ddometry and Gaussian splatting

Add code
Feb 07, 2026
Viaarxiv icon

DC-VLAQ: Query-Residual Aggregation for Robust Visual Place Recognition

Add code
Jan 19, 2026
Viaarxiv icon

Stable Language Guidance for Vision-Language-Action Models

Add code
Jan 07, 2026
Viaarxiv icon

High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting

Add code
Mar 25, 2025
Figure 1 for High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting
Figure 2 for High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting
Figure 3 for High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting
Figure 4 for High-Quality Spatial Reconstruction and Orthoimage Generation Using Efficient 2D Gaussian Splatting
Viaarxiv icon

Video Super-Resolution: All You Need is a Video Diffusion Model

Add code
Mar 05, 2025
Figure 1 for Video Super-Resolution: All You Need is a Video Diffusion Model
Figure 2 for Video Super-Resolution: All You Need is a Video Diffusion Model
Figure 3 for Video Super-Resolution: All You Need is a Video Diffusion Model
Figure 4 for Video Super-Resolution: All You Need is a Video Diffusion Model
Viaarxiv icon

A Multi-Sensor Fusion Approach for Rapid Orthoimage Generation in Large-Scale UAV Mapping

Add code
Mar 05, 2025
Viaarxiv icon