Picture for Hong Zhang

Hong Zhang

GraphCoT-VLA: A 3D Spatial-Aware Reasoning Vision-Language-Action Model for Robotic Manipulation with Ambiguous Instructions

Add code
Aug 11, 2025
Viaarxiv icon

Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion

Add code
Aug 07, 2025
Viaarxiv icon

JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction

Add code
Jul 23, 2025
Viaarxiv icon

TrackingMiM: Efficient Mamba-in-Mamba Serialization for Real-time UAV Object Tracking

Add code
Jul 02, 2025
Viaarxiv icon

AI Assistants to Enhance and Exploit the PETSc Knowledge Base

Add code
Jun 25, 2025
Viaarxiv icon

SuperPlace: The Renaissance of Classical Feature Aggregation for Visual Place Recognition in the Era of Foundation Models

Add code
Jun 16, 2025
Viaarxiv icon

EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition

Add code
Jun 16, 2025
Viaarxiv icon

MT-PCR: A Hybrid Mamba-Transformer with Spatial Serialization for Hierarchical Point Cloud Registration

Add code
Jun 16, 2025
Viaarxiv icon

TPT-Bench: A Large-Scale, Long-Term and Robot-Egocentric Dataset for Benchmarking Target Person Tracking

Add code
May 12, 2025
Viaarxiv icon

Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing

Add code
Apr 30, 2025
Viaarxiv icon