Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Matteo Ronchetti

DualTrack: Sensorless 3D Ultrasound needs Local and Global Context

Sep 11, 2025

Paul F. R. Wilson, Matteo Ronchetti, Rüdiger Göbl, Viktoria Markova, Sebastian Rosenzweig, Raphael Prevost, Parvin Mousavi, Oliver Zettinig

Abstract:Three-dimensional ultrasound (US) offers many clinical advantages over conventional 2D imaging, yet its widespread adoption is limited by the cost and complexity of traditional 3D systems. Sensorless 3D US, which uses deep learning to estimate a 3D probe trajectory from a sequence of 2D US images, is a promising alternative. Local features, such as speckle patterns, can help predict frame-to-frame motion, while global features, such as coarse shapes and anatomical structures, can situate the scan relative to anatomy and help predict its general shape. In prior approaches, global features are either ignored or tightly coupled with local feature extraction, restricting the ability to robustly model these two complementary aspects. We propose DualTrack, a novel dual-encoder architecture that leverages decoupled local and global encoders specialized for their respective scales of feature extraction. The local encoder uses dense spatiotemporal convolutions to capture fine-grained features, while the global encoder utilizes an image backbone (e.g., a 2D CNN or foundation model) and temporal attention layers to embed high-level anatomical features and long-range dependencies. A lightweight fusion module then combines these features to estimate the trajectory. Experimental results on a large public benchmark show that DualTrack achieves state-of-the-art accuracy and globally consistent 3D reconstructions, outperforming previous methods and yielding an average reconstruction error below 5 mm.

Via

Access Paper or Ask Questions

2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization

Oct 18, 2024

Junan Chen, Matteo Ronchetti, Verena Stehl, Van Nguyen, Muhannad Al Kallaa, Mahesh Thalwaththe Gedara, Claudia Lölkes, Stefan Moser, Maximilian Seidl, Matthias Wieczorek

Abstract:Recent developments in the registration of histology and micro-computed tomography ({\mu}CT) have broadened the perspective of pathological applications such as virtual histology based on {\mu}CT. This topic remains challenging because of the low image quality of soft tissue CT. Additionally, soft tissue samples usually deform during the histology slide preparation, making it difficult to correlate the structures between histology slide and {\mu}CT. In this work, we propose a novel 2D-3D multi-modal deformable image registration method. The method uses a machine learning (ML) based initialization followed by the registration. The registration is finalized by an analytical out-of-plane deformation refinement. The method is evaluated on datasets acquired from tonsil and tumor tissues. {\mu}CTs of both phase-contrast and conventional absorption modalities are investigated. The registration results from the proposed method are compared with those from intensity- and keypoint-based methods. The comparison is conducted using both visual and fiducial-based evaluations. The proposed method demonstrates superior performance compared to the other two methods.

* 12 pages, 4 figures

Via

Access Paper or Ask Questions

DISA: DIfferentiable Similarity Approximation for Universal Multimodal Registration

Jul 19, 2023

Matteo Ronchetti, Wolfgang Wein, Nassir Navab, Oliver Zettinig, Raphael Prevost

Figure 1 for DISA: DIfferentiable Similarity Approximation for Universal Multimodal Registration

Figure 2 for DISA: DIfferentiable Similarity Approximation for Universal Multimodal Registration

Figure 3 for DISA: DIfferentiable Similarity Approximation for Universal Multimodal Registration

Figure 4 for DISA: DIfferentiable Similarity Approximation for Universal Multimodal Registration

Abstract:Multimodal image registration is a challenging but essential step for numerous image-guided procedures. Most registration algorithms rely on the computation of complex, frequently non-differentiable similarity metrics to deal with the appearance discrepancy of anatomical structures between imaging modalities. Recent Machine Learning based approaches are limited to specific anatomy-modality combinations and do not generalize to new settings. We propose a generic framework for creating expressive cross-modal descriptors that enable fast deformable global registration. We achieve this by approximating existing metrics with a dot-product in the feature space of a small convolutional neural network (CNN) which is inherently differentiable can be trained without registered data. Our method is several orders of magnitude faster than local patch-based metrics and can be directly applied in clinical settings by replacing the similarity measure with the proposed one. Experiments on three different datasets demonstrate that our approach generalizes well beyond the training data, yielding a broad capture range even on unseen anatomies and modality pairs, without the need for specialized retraining. We make our training code and data publicly available.

* This preprint was submitted to MICCAI 2023. The Version of Record of this contribution will be published in Springer LNCS

Via

Access Paper or Ask Questions

PRO-TIP: Phantom for RObust automatic ultrasound calibration by TIP detection

Jun 13, 2022

Matteo Ronchetti, Julia Rackerseder, Maria Tirindelli, Mehrdad Salehi, Nassir Navab, Wolfgang Wein, Oliver Zettinig

Figure 1 for PRO-TIP: Phantom for RObust automatic ultrasound calibration by TIP detection

Figure 2 for PRO-TIP: Phantom for RObust automatic ultrasound calibration by TIP detection

Figure 3 for PRO-TIP: Phantom for RObust automatic ultrasound calibration by TIP detection

Figure 4 for PRO-TIP: Phantom for RObust automatic ultrasound calibration by TIP detection

Abstract:We propose a novel method to automatically calibrate tracked ultrasound probes. To this end we design a custom phantom consisting of nine cones with different heights. The tips are used as key points to be matched between multiple sweeps. We extract them using a convolutional neural network to segment the cones in every ultrasound frame and then track them across the sweep. The calibration is robustly estimated using RANSAC and later refined employing image based techniques. Our phantom can be 3D-printed and offers many advantages over state-of-the-art methods. The phantom design and algorithm code are freely available online. Since our phantom does not require a tracking target on itself, ease of use is improved over currently used techniques. The fully automatic method generalizes to new probes and different vendors, as shown in our experiments. Our approach produces results comparable to calibrations obtained by a domain expert.

* This preprint was submitted to MICCAI 2022. The Version of Record of this contribution will be published in Springer LNCS

Via

Access Paper or Ask Questions

Global Multi-modal 2D/3D Registration via Local Descriptors Learning

May 06, 2022

Viktoria Markova, Matteo Ronchetti, Wolfgang Wein, Oliver Zettinig, Raphael Prevost

Figure 1 for Global Multi-modal 2D/3D Registration via Local Descriptors Learning

Figure 2 for Global Multi-modal 2D/3D Registration via Local Descriptors Learning

Figure 3 for Global Multi-modal 2D/3D Registration via Local Descriptors Learning

Figure 4 for Global Multi-modal 2D/3D Registration via Local Descriptors Learning

Abstract:Multi-modal registration is a required step for many image-guided procedures, especially ultrasound-guided interventions that require anatomical context. While a number of such registration algorithms are already available, they all require a good initialization to succeed due to the challenging appearance of ultrasound images and the arbitrary coordinate system they are acquired in. In this paper, we present a novel approach to solve the problem of registration of an ultrasound sweep to a pre-operative image. We learn dense keypoint descriptors from which we then estimate the registration. We show that our method overcomes the challenges inherent to registration tasks with freehand ultrasound sweeps, namely, the multi-modality and multidimensionality of the data in addition to lack of precise ground truth and low amounts of training examples. We derive a registration method that is fast, generic, fully automatic, does not require any initialization and can naturally generate visualizations aiding interpretability and explainability. Our approach is evaluated on a clinical dataset of paired MR volumes and ultrasound sequences.

* This preprint was submitted to MICCAI 2022 and has not undergone post-submission improvements or corrections. The Version of Record of this contribution will be published in Springer LNCS

Via

Access Paper or Ask Questions

Generative Tomography Reconstruction

Oct 26, 2020

Matteo Ronchetti, Davide Bacciu

Figure 1 for Generative Tomography Reconstruction

Figure 2 for Generative Tomography Reconstruction

Figure 3 for Generative Tomography Reconstruction

Figure 4 for Generative Tomography Reconstruction

Abstract:We propose an end-to-end differentiable architecture for tomography reconstruction that directly maps a noisy sinogram into a denoised reconstruction. Compared to existing approaches our end-to-end architecture produces more accurate reconstructions while using less parameters and time. We also propose a generative model that, given a noisy sinogram, can sample realistic reconstructions. This generative model can be used as prior inside an iterative process that, by taking into consideration the physical model, can reduce artifacts and errors in the reconstructions.

* Accepted as a poster for the NeurIPS 2020 Workshop on Deep Learning and Inverse Problems

Via

Access Paper or Ask Questions

TorchRadon: Fast Differentiable Routines for Computed Tomography

Sep 29, 2020

Matteo Ronchetti

Figure 1 for TorchRadon: Fast Differentiable Routines for Computed Tomography

Figure 2 for TorchRadon: Fast Differentiable Routines for Computed Tomography

Figure 3 for TorchRadon: Fast Differentiable Routines for Computed Tomography

Figure 4 for TorchRadon: Fast Differentiable Routines for Computed Tomography

Abstract:This work presents TorchRadon -- an open source CUDA library which contains a set of differentiable routines for solving computed tomography (CT) reconstruction problems. The library is designed to help researchers working on CT problems to combine deep learning and model-based approaches. The package is developed as a PyTorch extension and can be seamlessly integrated into existing deep learning training code. Compared to the existing Astra Toolbox, TorchRadon is up to 125 faster. The operators implemented by TorchRadon allow the computation of gradients using PyTorch backward(), and can therefore be easily inserted inside existing neural networks architectures. Because of its speed and GPU support, TorchRadon can also be effectively used as a fast backend for the implementation of iterative algorithms. This paper presents the main functionalities of the library, compares results with existing libraries and provides examples of usage.

Via

Access Paper or Ask Questions

IKA: Independent Kernel Approximator

Sep 05, 2018

Matteo Ronchetti

Abstract:This paper describes a new method for low rank kernel approximation called IKA. The main advantage of IKA is that it produces a function $\psi(x)$ defined as a linear combination of arbitrarily chosen functions. In contrast the approximation produced by Nystr\"om method is a linear combination of kernel evaluations. The proposed method consistently outperformed Nystr\"om method in a comparison on the STL-10 dataset. Numerical results are reproducible using the source code available at https://gitlab.com/matteo-ronchetti/IKA

Via

Access Paper or Ask Questions