Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jay Hineman

Geometric Feature Prompting of Image Segmentation Models

May 27, 2025

Kenneth Ball, Erin Taylor, Nirav Patel, Andrew Bartels, Gary Koplik, James Polly, Jay Hineman

Abstract:Advances in machine learning, especially the introduction of transformer architectures and vision transformers, have led to the development of highly capable computer vision foundation models. The segment anything model (known colloquially as SAM and more recently SAM 2), is a highly capable foundation model for segmentation of natural images and has been further applied to medical and scientific image segmentation tasks. SAM relies on prompts -- points or regions of interest in an image -- to generate associated segmentations. In this manuscript we propose the use of a geometrically motivated prompt generator to produce prompt points that are colocated with particular features of interest. Focused prompting enables the automatic generation of sensitive and specific segmentations in a scientific image analysis task using SAM with relatively few point prompts. The image analysis task examined is the segmentation of plant roots in rhizotron or minirhizotron images, which has historically been a difficult task to automate. Hand annotation of rhizotron images is laborious and often subjective; SAM, initialized with GeomPrompt local ridge prompts has the potential to dramatically improve rhizotron image processing. The authors have concurrently released an open source software suite called geomprompt https://pypi.org/project/geomprompt/ that can produce point prompts in a format that enables direct integration with the segment-anything package.

Via

Access Paper or Ask Questions

Supervised Learning of Labeled Pointcloud Differences via Cover-Tree Entropy Reduction

Jan 19, 2018

Abraham Smith, Paul Bendich, John Harer, Alex Pieloch, Jay Hineman

Figure 1 for Supervised Learning of Labeled Pointcloud Differences via Cover-Tree Entropy Reduction

Figure 2 for Supervised Learning of Labeled Pointcloud Differences via Cover-Tree Entropy Reduction

Figure 3 for Supervised Learning of Labeled Pointcloud Differences via Cover-Tree Entropy Reduction

Figure 4 for Supervised Learning of Labeled Pointcloud Differences via Cover-Tree Entropy Reduction

Abstract:We introduce a new algorithm, called CDER, for supervised machine learning that merges the multi-scale geometric properties of Cover Trees with the information-theoretic properties of entropy. CDER applies to a training set of labeled pointclouds embedded in a common Euclidean space. If typical pointclouds corresponding to distinct labels tend to differ at any scale in any sub-region, CDER can identify these differences in (typically) linear time, creating a set of distributional coordinates which act as a feature extraction mechanism for supervised learning. We describe theoretical properties and implementation details of CDER, and illustrate its benefits on several synthetic examples.

* Distribution Statement A - Approved for public release, distribution is unlimited. Version 2: added link to code, and some minor improvements. Version 3: updated authors and thanks

Via

Access Paper or Ask Questions

Geometric Cross-Modal Comparison of Heterogeneous Sensor Data

Nov 23, 2017

Christopher J. Tralie, Abraham Smith, Nathan Borggren, Jay Hineman, Paul Bendich, Peter Zulch, John Harer

Figure 1 for Geometric Cross-Modal Comparison of Heterogeneous Sensor Data

Figure 2 for Geometric Cross-Modal Comparison of Heterogeneous Sensor Data

Figure 3 for Geometric Cross-Modal Comparison of Heterogeneous Sensor Data

Figure 4 for Geometric Cross-Modal Comparison of Heterogeneous Sensor Data

Abstract:In this work, we address the problem of cross-modal comparison of aerial data streams. A variety of simulated automobile trajectories are sensed using two different modalities: full-motion video, and radio-frequency (RF) signals received by detectors at various locations. The information represented by the two modalities is compared using self-similarity matrices (SSMs) corresponding to time-ordered point clouds in feature spaces of each of these data sources; we note that these feature spaces can be of entirely different scale and dimensionality. Several metrics for comparing SSMs are explored, including a cutting-edge time-warping technique that can simultaneously handle local time warping and partial matches, while also controlling for the change in geometry between feature spaces of the two modalities. We note that this technique is quite general, and does not depend on the choice of modalities. In this particular setting, we demonstrate that the cross-modal distance between SSMs corresponding to the same trajectory type is smaller than the cross-modal distance between SSMs corresponding to distinct trajectory types, and we formalize this observation via precision-recall metrics in experiments. Finally, we comment on promising implications of these ideas for future integration into multiple-hypothesis tracking systems.

* 10 pages, 13 figures, Proceedings of IEEE Aeroconf 2017

Via

Access Paper or Ask Questions