Picture for Mengya Xu

Mengya Xu

SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling

Add code
Dec 30, 2025
Viaarxiv icon

Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge

Add code
Jul 22, 2025
Viaarxiv icon

SAP-Bench: Benchmarking Multimodal Large Language Models in Surgical Action Planning

Add code
Jun 08, 2025
Viaarxiv icon

EndoARSS: Adapting Spatially-Aware Foundation Model for Efficient Activity Recognition and Semantic Segmentation in Endoscopic Surgery

Add code
Jun 07, 2025
Viaarxiv icon

ETSM: Automating Dissection Trajectory Suggestion and Confidence Map-Based Safety Margin Prediction for Robot-assisted Endoscopic Submucosal Dissection

Add code
Nov 28, 2024
Viaarxiv icon

PDZSeg: Adapting the Foundation Model for Dissection Zone Segmentation with Visual Prompts in Robot-assisted Endoscopic Submucosal Dissection

Add code
Nov 27, 2024
Viaarxiv icon

Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data

Add code
Sep 24, 2024
Viaarxiv icon

A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery

Add code
Aug 08, 2024
Viaarxiv icon

SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation

Add code
Aug 08, 2024
Figure 1 for SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation
Figure 2 for SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation
Figure 3 for SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation
Figure 4 for SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation
Viaarxiv icon

PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery

Add code
May 22, 2024
Figure 1 for PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery
Figure 2 for PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery
Figure 3 for PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery
Figure 4 for PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery
Viaarxiv icon