Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ashwin Sridhar

Exploring Pre-training Across Domains for Few-Shot Surgical Skill Assessment

Sep 11, 2025

Dimitrios Anastasiou, Razvan Caramalau, Nazir Sirajudeen, Matthew Boal, Philip Edwards, Justin Collins, John Kelly, Ashwin Sridhar, Maxine Tran, Faiz Mumtaz(+4 more)

Figure 1 for Exploring Pre-training Across Domains for Few-Shot Surgical Skill Assessment

Figure 2 for Exploring Pre-training Across Domains for Few-Shot Surgical Skill Assessment

Figure 3 for Exploring Pre-training Across Domains for Few-Shot Surgical Skill Assessment

Figure 4 for Exploring Pre-training Across Domains for Few-Shot Surgical Skill Assessment

Abstract:Automated surgical skill assessment (SSA) is a central task in surgical computer vision. Developing robust SSA models is challenging due to the scarcity of skill annotations, which are time-consuming to produce and require expert consensus. Few-shot learning (FSL) offers a scalable alternative enabling model development with minimal supervision, though its success critically depends on effective pre-training. While widely studied for several surgical downstream tasks, pre-training has remained largely unexplored in SSA. In this work, we formulate SSA as a few-shot task and investigate how self-supervised pre-training strategies affect downstream few-shot SSA performance. We annotate a publicly available robotic surgery dataset with Objective Structured Assessment of Technical Skill (OSATS) scores, and evaluate various pre-training sources across three few-shot settings. We quantify domain similarity and analyze how domain gap and the inclusion of procedure-specific data into pre-training influence transferability. Our results show that small but domain-relevant datasets can outperform large scale, less aligned ones, achieving accuracies of 60.16%, 66.03%, and 73.65% in the 1-, 2-, and 5-shot settings, respectively. Moreover, incorporating procedure-specific data into pre-training with a domain-relevant external dataset significantly boosts downstream performance, with an average gain of +1.22% in accuracy and +2.28% in F1-score; however, applying the same strategy with less similar but large-scale sources can instead lead to performance degradation. Code and models are available at https://github.com/anastadimi/ssa-fsl.

* Accepted at MICCAI 2025 DEMI Workshop

Via

Access Paper or Ask Questions

Higher Order of Motion Magnification for Vessel Localisation in Surgical Video

Jun 13, 2018

Mirek Janatka, Ashwin Sridhar, John Kelly, Danail Stoyanov

Figure 1 for Higher Order of Motion Magnification for Vessel Localisation in Surgical Video

Figure 2 for Higher Order of Motion Magnification for Vessel Localisation in Surgical Video

Figure 3 for Higher Order of Motion Magnification for Vessel Localisation in Surgical Video

Figure 4 for Higher Order of Motion Magnification for Vessel Localisation in Surgical Video

Abstract:Locating vessels during surgery is critical for avoiding inadvertent damage, yet vasculature can be difficult to identify. Video motion magnification can potentially highlight vessels by exaggerating subtle motion embedded within the video to become perceivable to the surgeon. In this paper, we explore a physiological model of artery distension to extend motion magnification to incorporate higher orders of motion, leveraging the difference in acceleration over time (jerk) in pulsatile motion to highlight the vascular pulse wave. Our method is compared to first and second order motion based Eulerian video magnification algorithms. Using data from a surgical video retrieved during a robotic prostatectomy, we show that our method can accentuate cardio-physiological features and produce a more succinct and clearer video for motion magnification, with more similarities in areas without motion to the source video at large magnifications. We validate the approach with a Structure Similarity (SSIM) and Peak Signal to Noise Ratio (PSNR) assessment of three videos at an increasing working distance, using three different levels of optical magnification. Spatio-temporal cross sections are presented to show the effectiveness of our proposal and video samples are provided to demonstrates qualitatively our results.

* Accepted to the International Conference On Medical Image Computing & Computer Assisted Intervention, 2018

Via

Access Paper or Ask Questions