Picture for Qi Dou

Qi Dou

for the ALFA study

DSTED: Decoupling Temporal Stabilization and Discriminative Enhancement for Surgical Workflow Recognition

Add code
Dec 22, 2025
Viaarxiv icon

CP-Env: Evaluating Large Language Models on Clinical Pathways in a Controllable Hospital Environment

Add code
Dec 12, 2025
Viaarxiv icon

Vulnerable Agent Identification in Large-Scale Multi-Agent Reinforcement Learning

Add code
Sep 18, 2025
Viaarxiv icon

Toward Robust Medical Fairness: Debiased Dual-Modal Alignment via Text-Guided Attribute-Disentangled Prompt Learning for Vision-Language Models

Add code
Aug 26, 2025
Figure 1 for Toward Robust Medical Fairness: Debiased Dual-Modal Alignment via Text-Guided Attribute-Disentangled Prompt Learning for Vision-Language Models
Figure 2 for Toward Robust Medical Fairness: Debiased Dual-Modal Alignment via Text-Guided Attribute-Disentangled Prompt Learning for Vision-Language Models
Figure 3 for Toward Robust Medical Fairness: Debiased Dual-Modal Alignment via Text-Guided Attribute-Disentangled Prompt Learning for Vision-Language Models
Figure 4 for Toward Robust Medical Fairness: Debiased Dual-Modal Alignment via Text-Guided Attribute-Disentangled Prompt Learning for Vision-Language Models
Viaarxiv icon

ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing

Add code
Aug 14, 2025
Figure 1 for ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Figure 2 for ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Figure 3 for ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Figure 4 for ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Viaarxiv icon

ClipGS: Clippable Gaussian Splatting for Interactive Cinematic Visualization of Volumetric Medical Data

Add code
Jul 09, 2025
Viaarxiv icon

Toward Reliable AR-Guided Surgical Navigation: Interactive Deformation Modeling with Data-Driven Biomechanics and Prompts

Add code
Jun 11, 2025
Viaarxiv icon

SAP-Bench: Benchmarking Multimodal Large Language Models in Surgical Action Planning

Add code
Jun 08, 2025
Viaarxiv icon

Learning dissection trajectories from expert surgical videos via imitation learning with equivariant diffusion

Add code
Jun 05, 2025
Figure 1 for Learning dissection trajectories from expert surgical videos via imitation learning with equivariant diffusion
Figure 2 for Learning dissection trajectories from expert surgical videos via imitation learning with equivariant diffusion
Figure 3 for Learning dissection trajectories from expert surgical videos via imitation learning with equivariant diffusion
Figure 4 for Learning dissection trajectories from expert surgical videos via imitation learning with equivariant diffusion
Viaarxiv icon

Medical Large Vision Language Models with Multi-Image Visual Ability

Add code
May 25, 2025
Viaarxiv icon