Picture for Jiazhao Zhang

Jiazhao Zhang

RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction

Add code
Jul 23, 2025
Viaarxiv icon

BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion

Add code
Jun 18, 2025
Viaarxiv icon

OctoNav: Towards Generalist Embodied Navigation

Add code
Jun 11, 2025
Viaarxiv icon

TrackVLA: Embodied Visual Tracking in the Wild

Add code
May 29, 2025
Viaarxiv icon

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Add code
Apr 26, 2025
Viaarxiv icon

OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging

Add code
Mar 03, 2025
Viaarxiv icon

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Add code
Feb 18, 2025
Viaarxiv icon

Neural Observation Field Guided Hybrid Optimization of Camera Placement

Add code
Dec 11, 2024
Figure 1 for Neural Observation Field Guided Hybrid Optimization of Camera Placement
Figure 2 for Neural Observation Field Guided Hybrid Optimization of Camera Placement
Figure 3 for Neural Observation Field Guided Hybrid Optimization of Camera Placement
Figure 4 for Neural Observation Field Guided Hybrid Optimization of Camera Placement
Viaarxiv icon

CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs

Add code
Dec 11, 2024
Figure 1 for CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs
Figure 2 for CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs
Figure 3 for CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs
Figure 4 for CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs
Viaarxiv icon

Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks

Add code
Dec 09, 2024
Viaarxiv icon