Picture for Gao Huang

Gao Huang

UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy

Add code
Nov 19, 2025
Figure 1 for UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy
Figure 2 for UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy
Figure 3 for UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy
Figure 4 for UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy
Viaarxiv icon

Step by Step Network

Add code
Nov 18, 2025
Viaarxiv icon

SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation

Add code
Nov 12, 2025
Viaarxiv icon

Are My Optimized Prompts Compromised? Exploring Vulnerabilities of LLM-based Optimizers

Add code
Oct 16, 2025
Figure 1 for Are My Optimized Prompts Compromised? Exploring Vulnerabilities of LLM-based Optimizers
Figure 2 for Are My Optimized Prompts Compromised? Exploring Vulnerabilities of LLM-based Optimizers
Figure 3 for Are My Optimized Prompts Compromised? Exploring Vulnerabilities of LLM-based Optimizers
Viaarxiv icon

Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception

Add code
Sep 18, 2025
Viaarxiv icon

UltraHiT: A Hierarchical Transformer Architecture for Generalizable Internal Carotid Artery Robotic Ultrasonography

Add code
Sep 17, 2025
Viaarxiv icon

MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation

Add code
Aug 26, 2025
Figure 1 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 2 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 3 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Figure 4 for MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Viaarxiv icon

Video Perception Models for 3D Scene Synthesis

Add code
Jun 25, 2025
Figure 1 for Video Perception Models for 3D Scene Synthesis
Figure 2 for Video Perception Models for 3D Scene Synthesis
Figure 3 for Video Perception Models for 3D Scene Synthesis
Figure 4 for Video Perception Models for 3D Scene Synthesis
Viaarxiv icon

DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding

Add code
May 08, 2025
Viaarxiv icon

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Add code
May 07, 2025
Figure 1 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 2 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 3 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Figure 4 for Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Viaarxiv icon