Picture for Yafei Ou

Yafei Ou

Step-CoT: Stepwise Visual Chain-of-Thought for Medical Visual Question Answering

Add code
Mar 14, 2026
Viaarxiv icon

PipeMFL-240K: A Large-scale Dataset and Benchmark for Object Detection in Pipeline Magnetic Flux Leakage Imaging

Add code
Feb 04, 2026
Viaarxiv icon

Experience-Driven Multi-Agent Systems Are Training-free Context-aware Earth Observers

Add code
Jan 30, 2026
Viaarxiv icon

CRESSim-MPM: A Material Point Method Library for Surgical Soft Body Simulation with Cutting and Suturing

Add code
Feb 25, 2025
Figure 1 for CRESSim-MPM: A Material Point Method Library for Surgical Soft Body Simulation with Cutting and Suturing
Figure 2 for CRESSim-MPM: A Material Point Method Library for Surgical Soft Body Simulation with Cutting and Suturing
Figure 3 for CRESSim-MPM: A Material Point Method Library for Surgical Soft Body Simulation with Cutting and Suturing
Figure 4 for CRESSim-MPM: A Material Point Method Library for Surgical Soft Body Simulation with Cutting and Suturing
Viaarxiv icon

Layer Separation: Adjustable Joint Space Width Images Synthesis in Conventional Radiography

Add code
Feb 04, 2025
Viaarxiv icon

Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning

Add code
Nov 21, 2024
Figure 1 for Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning
Figure 2 for Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning
Figure 3 for Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning
Figure 4 for Learning Autonomous Surgical Irrigation and Suction with the da Vinci Research Kit Using Reinforcement Learning
Viaarxiv icon

BLS-GAN: A Deep Layer Separation Framework for Eliminating Bone Overlap in Conventional Radiographs

Add code
Sep 11, 2024
Viaarxiv icon

From Decision to Action in Surgical Autonomy: Multi-Modal Large Language Models for Robot-Assisted Blood Suction

Add code
Aug 14, 2024
Figure 1 for From Decision to Action in Surgical Autonomy: Multi-Modal Large Language Models for Robot-Assisted Blood Suction
Figure 2 for From Decision to Action in Surgical Autonomy: Multi-Modal Large Language Models for Robot-Assisted Blood Suction
Figure 3 for From Decision to Action in Surgical Autonomy: Multi-Modal Large Language Models for Robot-Assisted Blood Suction
Figure 4 for From Decision to Action in Surgical Autonomy: Multi-Modal Large Language Models for Robot-Assisted Blood Suction
Viaarxiv icon

Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis

Add code
Jun 21, 2024
Figure 1 for Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis
Figure 2 for Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis
Figure 3 for Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis
Figure 4 for Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis
Viaarxiv icon

MDA: An Interpretable Multi-Modal Fusion with Missing Modalities and Intrinsic Noise

Add code
Jun 15, 2024
Figure 1 for MDA: An Interpretable Multi-Modal Fusion with Missing Modalities and Intrinsic Noise
Figure 2 for MDA: An Interpretable Multi-Modal Fusion with Missing Modalities and Intrinsic Noise
Figure 3 for MDA: An Interpretable Multi-Modal Fusion with Missing Modalities and Intrinsic Noise
Figure 4 for MDA: An Interpretable Multi-Modal Fusion with Missing Modalities and Intrinsic Noise
Viaarxiv icon