Picture for Hao Ding

Hao Ding

Brian

Open-H-Embodiment: A Large-Scale Dataset for Enabling Foundation Models in Medical Robotics

Add code
Apr 22, 2026
Viaarxiv icon

Compressing Sequences in the Latent Embedding Space: $K$-Token Merging for Large Language Models

Add code
Apr 16, 2026
Viaarxiv icon

AffordTissue: Dense Affordance Prediction for Tool-Action Specific Tissue Interaction

Add code
Apr 01, 2026
Viaarxiv icon

SAW: Toward a Surgical Action World Model via Controllable and Scalable Video Generation

Add code
Mar 13, 2026
Viaarxiv icon

Towards Controllable Video Synthesis of Routine and Rare OR Events

Add code
Feb 24, 2026
Viaarxiv icon

TikArt: Aperture-Guided Observation for Fine-Grained Visual Reasoning via Reinforcement Learning

Add code
Feb 16, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation

Add code
Nov 12, 2025
Figure 1 for BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation
Figure 2 for BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation
Figure 3 for BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation
Figure 4 for BronchOpt : Vision-Based Pose Optimization with Fine-Tuned Foundation Models for Accurate Bronchoscopy Navigation
Viaarxiv icon

TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research

Add code
Nov 10, 2025
Viaarxiv icon

Did you just see that? Arbitrary view synthesis for egocentric replay of operating room workflows from ambient sensors

Add code
Oct 06, 2025
Viaarxiv icon