Picture for Qi Dou

Qi Dou

for the ALFA study

Surg$Σ$: A Spectrum of Large-Scale Multimodal Data and Foundation Models for Surgical Intelligence

Add code
Mar 17, 2026
Viaarxiv icon

Generalized Recognition of Basic Surgical Actions Enables Skill Assessment and Vision-Language-Model-based Surgical Planning

Add code
Mar 13, 2026
Viaarxiv icon

Surg-R1: A Hierarchical Reasoning Foundation Model for Scalable and Interpretable Surgical Decision Support with Multi-Center Clinical Validation

Add code
Mar 12, 2026
Viaarxiv icon

CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

Add code
Mar 04, 2026
Viaarxiv icon

The Dresden Dataset for 4D Reconstruction of Non-Rigid Abdominal Surgical Scenes

Add code
Mar 03, 2026
Viaarxiv icon

Real-time Monocular 2D and 3D Perception of Endoluminal Scenes for Controlling Flexible Robotic Endoscopic Instruments

Add code
Feb 16, 2026
Viaarxiv icon

ARport: An Augmented Reality System for Markerless Image-Guided Port Placement in Robotic Surgery

Add code
Feb 15, 2026
Viaarxiv icon

Concepts from Representations: Post-hoc Concept Bottleneck Models via Sparse Decomposition of Visual Representations

Add code
Jan 18, 2026
Viaarxiv icon

DSTED: Decoupling Temporal Stabilization and Discriminative Enhancement for Surgical Workflow Recognition

Add code
Dec 22, 2025
Figure 1 for DSTED: Decoupling Temporal Stabilization and Discriminative Enhancement for Surgical Workflow Recognition
Figure 2 for DSTED: Decoupling Temporal Stabilization and Discriminative Enhancement for Surgical Workflow Recognition
Figure 3 for DSTED: Decoupling Temporal Stabilization and Discriminative Enhancement for Surgical Workflow Recognition
Figure 4 for DSTED: Decoupling Temporal Stabilization and Discriminative Enhancement for Surgical Workflow Recognition
Viaarxiv icon

CP-Env: Evaluating Large Language Models on Clinical Pathways in a Controllable Hospital Environment

Add code
Dec 12, 2025
Viaarxiv icon