Picture for Hao Zhang

Hao Zhang

refer to the report for detailed contributions

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Add code
Apr 27, 2026
Viaarxiv icon

X2-N: A Transformable Wheel-legged Humanoid Robot with Dual-mode Locomotion and Manipulation

Add code
Apr 23, 2026
Viaarxiv icon

Hybrid Latent Reasoning with Decoupled Policy Optimization

Add code
Apr 22, 2026
Viaarxiv icon

SegTTA: Training-Free Test-Time Augmentation for Zero-Shot Medical Imaging Segmentation

Add code
Apr 19, 2026
Viaarxiv icon

Learning Versatile Humanoid Manipulation with Touch Dreaming

Add code
Apr 14, 2026
Viaarxiv icon

EMMa: End-Effector Stability-Oriented Mobile Manipulation for Tracked Rescue Robots

Add code
Apr 09, 2026
Viaarxiv icon

Physics-Informed Untrained Learning for RGB-Guided Superresolution Single-Pixel Hyperspectral Imaging

Add code
Apr 04, 2026
Viaarxiv icon

DeCo-DETR: Decoupled Cognition DETR for efficient Open-Vocabulary Object Detection

Add code
Apr 03, 2026
Viaarxiv icon

ProtoFlow: Mitigating Forgetting in Class-Incremental Remote Sensing Segmentation via Low-Curvature Prototype Flow

Add code
Apr 03, 2026
Viaarxiv icon

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Add code
Apr 02, 2026
Viaarxiv icon