Picture for Gao Huang

Gao Huang

Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception

Add code
Sep 18, 2025
Viaarxiv icon

UltraHiT: A Hierarchical Transformer Architecture for Generalizable Internal Carotid Artery Robotic Ultrasonography

Add code
Sep 17, 2025
Viaarxiv icon

MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation

Add code
Aug 26, 2025
Figure 1 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 2 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 3 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 4 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Viaarxiv icon

Video Perception Models for 3D Scene Synthesis

Add code
Jun 25, 2025
Viaarxiv icon

DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding

Add code
May 08, 2025
Viaarxiv icon

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Add code
May 07, 2025
Viaarxiv icon

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Add code
Apr 18, 2025
Viaarxiv icon

CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning

Add code
Apr 18, 2025
Viaarxiv icon

EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance

Add code
Apr 17, 2025
Viaarxiv icon

DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation

Add code
Apr 09, 2025
Viaarxiv icon