Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stefan Sandfeld

Towards knowledge-based workflows: a semantic approach to atomistic simulations for mechanical and thermodynamic properties

Feb 01, 2026

Abril Azocar Guzman, Hoang-Thien Luu, Sarath Menon, Tilmann Hickel, Nina Merkert, Stefan Sandfeld

Abstract:Mechanical and thermodynamic properties, including the influence of crystal defects, are critical for evaluating materials in engineering applications. Molecular dynamics simulations provide valuable insight into these mechanisms at the atomic scale. However, current practice often relies on fragmented scripts with inconsistent metadata and limited provenance, which hinders reproducibility, interoperability, and reuse. FAIR data principles and workflow-based approaches offer a path to address these limitations. We present reusable atomistic workflows that incorporate metadata annotation aligned with application ontologies, enabling automatic provenance capture and FAIR-compliant data outputs. The workflows cover key mechanical and thermodynamic quantities, including equation of state, elastic tensors, mechanical loading, thermal properties, defect formation energies, and nanoindentation. We demonstrate validation of structure-property relations such as the Hall-Petch effect and show that the workflows can be reused across different interatomic potentials and materials within a coherent semantic framework. The approach provides AI-ready simulation data, supports emerging agentic AI workflows, and establishes a generalizable blueprint for knowledge-based mechanical and thermodynamic simulations.

Via

Access Paper or Ask Questions

Ontology-aligned structuring and reuse of multimodal materials data and workflows towards automatic reproduction

Jan 18, 2026

Sepideh Baghaee Ravari, Abril Azocar Guzman, Sarath Menon, Stefan Sandfeld, Tilmann Hickel, Markus Stricker

Abstract:Reproducibility of computational results remains a challenge in materials science, as simulation workflows and parameters are often reported only in unstructured text and tables. While literature data are valuable for validation and reuse, the lack of machine-readable workflow descriptions prevents large-scale curation and systematic comparison. Existing text-mining approaches are insufficient to extract complete computational workflows with their associated parameters. An ontology-driven, large language model (LLM)-assisted framework is introduced for the automated extraction and structuring of computational workflows from the literature. The approach focuses on density functional theory-based stacking fault energy (SFE) calculations in hexagonal close-packed magnesium and its binary alloys, and uses a multi-stage filtering strategy together with prompt-engineered LLM extraction applied to method sections and tables. Extracted information is unified into a canonical schema and aligned with established materials ontologies (CMSO, ASMO, and PLDO), enabling the construction of a knowledge graph using atomRDF. The resulting knowledge graph enables systematic comparison of reported SFE values and supports the structured reuse of computational protocols. While full computational reproducibility is still constrained by missing or implicit metadata, the framework provides a foundation for organizing and contextualizing published results in a semantically interoperable form, thereby improving transparency and reusability of computational materials data.

* 39 pages, 7 figures

Via

Access Paper or Ask Questions

Out-of-distribution generalization of deep-learning surrogates for 2D PDE-generated dynamics in the small-data regime

Jan 13, 2026

Binh Duong Nguyen, Stefan Sandfeld

Abstract:Partial differential equations (PDEs) are a central tool for modeling the dynamics of physical, engineering, and materials systems, but high-fidelity simulations are often computationally expensive. At the same time, many scientific applications can be viewed as the evolution of spatially distributed fields, making data-driven forecasting of such fields a core task in scientific machine learning. In this work we study autoregressive deep-learning surrogates for two-dimensional PDE dynamics on periodic domains, focusing on generalization to out-of-distribution initial conditions within a fixed PDE and parameter regime and on strict small-data settings with at most $\mathcal{O}(10^2)$ simulated trajectories per system. We introduce a multi-channel U-Net [...], evaluate it on five qualitatively different PDE families and compare it to ViT, AFNO, PDE-Transformer, and KAN-UNet under a common training setup. Across all datasets, me-UNet matches or outperforms these more complex architectures in terms of field-space error, spectral similarity, and physics-based metrics for in-distribution rollouts, while requiring substantially less training time. It also generalizes qualitatively to unseen initial conditions with as few as $\approx 20$ training simulations. A data-efficiency study and Grad-CAM analysis further suggest that, in small-data periodic 2D PDE settings, convolutional architectures with inductive biases aligned to locality and periodic boundary conditions remain strong contenders for accurate and moderately out-of-distribution-robust surrogate modeling.

Via

Access Paper or Ask Questions

Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis

Feb 28, 2024

Bashir Kazimi, Karina Ruzaeva, Stefan Sandfeld

Figure 1 for Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis

Figure 2 for Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis

Figure 3 for Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis

Figure 4 for Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis

Abstract:In this work, we explore the potential of self-supervised learning from unlabeled electron microscopy datasets, taking a step toward building a foundation model in this field. We show how self-supervised pretraining facilitates efficient fine-tuning for a spectrum of downstream tasks, including semantic segmentation, denoising, noise & background removal, and super-resolution. Experimentation with varying model complexities and receptive field sizes reveals the remarkable phenomenon that fine-tuned models of lower complexity consistently outperform more complex models with random weight initialization. We demonstrate the versatility of self-supervised pretraining across various downstream tasks in the context of electron microscopy, allowing faster convergence and better performance. We conclude that self-supervised pretraining serves as a powerful catalyst, being especially advantageous when limited annotated data are available and efficient scaling of computational cost are important.

Via

Access Paper or Ask Questions

Combining unsupervised and supervised learning in microscopy enables defect analysis of a full 4H-SiC wafer

Feb 20, 2024

Binh Duong Nguyen, Johannes Steiner, Peter Wellmann, Stefan Sandfeld

Figure 1 for Combining unsupervised and supervised learning in microscopy enables defect analysis of a full 4H-SiC wafer

Figure 2 for Combining unsupervised and supervised learning in microscopy enables defect analysis of a full 4H-SiC wafer

Figure 3 for Combining unsupervised and supervised learning in microscopy enables defect analysis of a full 4H-SiC wafer

Figure 4 for Combining unsupervised and supervised learning in microscopy enables defect analysis of a full 4H-SiC wafer

Abstract:Detecting and analyzing various defect types in semiconductor materials is an important prerequisite for understanding the underlying mechanisms as well as tailoring the production processes. Analysis of microscopy images that reveal defects typically requires image analysis tasks such as segmentation and object detection. With the permanently increasing amount of data that is produced by experiments, handling these tasks manually becomes more and more impossible. In this work, we combine various image analysis and data mining techniques for creating a robust and accurate, automated image analysis pipeline. This allows for extracting the type and position of all defects in a microscopy image of a KOH-etched 4H-SiC wafer that was stitched together from approximately 40,000 individual images.

Via

Access Paper or Ask Questions

DISO: A Domain Ontology for Modeling Dislocations in Crystalline Materials

Jan 04, 2024

Ahmad Zainul Ihsan, Said Fathalla, Stefan Sandfeld

Figure 1 for DISO: A Domain Ontology for Modeling Dislocations in Crystalline Materials

Figure 2 for DISO: A Domain Ontology for Modeling Dislocations in Crystalline Materials

Figure 3 for DISO: A Domain Ontology for Modeling Dislocations in Crystalline Materials

Figure 4 for DISO: A Domain Ontology for Modeling Dislocations in Crystalline Materials

Abstract:Crystalline materials, such as metals and semiconductors, nearly always contain a special defect type called dislocation. This defect decisively determines many important material properties, e.g., strength, fracture toughness, or ductility. Over the past years, significant effort has been put into understanding dislocation behavior across different length scales via experimental characterization techniques and simulations. This paper introduces the dislocation ontology (DISO), which defines the concepts and relationships related to linear defects in crystalline materials. We developed DISO using a top-down approach in which we start defining the most general concepts in the dislocation domain and subsequent specialization of them. DISO is published through a persistent URL following W3C best practices for publishing Linked Data. Two potential use cases for DISO are presented to illustrate its usefulness in the dislocation dynamics domain. The evaluation of the ontology is performed in two directions, evaluating the success of the ontology in modeling a real-world domain and the richness of the ontology.

Via

Access Paper or Ask Questions

Machine learning for structure-guided materials and process design

Dec 22, 2023

Lukas Morand, Tarek Iraki, Johannes Dornheim, Stefan Sandfeld, Norbert Link, Dirk Helm

Abstract:In recent years, there has been a growing interest in accelerated materials innovation in both, research and industry. However, to truly add value to the development of new advanced materials, it is inevitable to take into account manufacturing processes and thereby tailor materials design approaches to support downstream process design approaches. As a major step into this direction, we present a holistic optimization approach that covers the entire materials process-structure-property chain. Our approach specifically employs machine learning techniques to address two critical identification problems. The first is to solve a materials design problem, which involves identifying near-optimal material structures that exhibit desired macroscopic properties. The second is to solve a process design problem that is to find an optimal processing path to manufacture these material structures. Both identification problems are typically ill-posed, which presents a significant challenge for solution approaches. However, the non-unique nature of these problems also offers an important advantage for processing: By having several target structures that perform similarly well, the corresponding processes can be efficiently guided towards manufacturing the best reachable structure. In particular, we apply deep reinforcement learning for process design in combination with a multi-task learning-based optimization approach for materials design. The functionality of the approach will be demonstrated by using it to manufacture crystallographic textures with desired properties in a metal forming process.

Via

Access Paper or Ask Questions

Modeling Dislocation Dynamics Data Using Semantic Web Technologies

Sep 13, 2023

Ahmad Zainul Ihsan, Said Fathalla, Stefan Sandfeld

Abstract:Research in the field of Materials Science and Engineering focuses on the design, synthesis, properties, and performance of materials. An important class of materials that is widely investigated are crystalline materials, including metals and semiconductors. Crystalline material typically contains a distinct type of defect called "dislocation". This defect significantly affects various material properties, including strength, fracture toughness, and ductility. Researchers have devoted a significant effort in recent years to understanding dislocation behavior through experimental characterization techniques and simulations, e.g., dislocation dynamics simulations. This paper presents how data from dislocation dynamics simulations can be modeled using semantic web technologies through annotating data with ontologies. We extend the already existing Dislocation Ontology by adding missing concepts and aligning it with two other domain-related ontologies (i.e., the Elementary Multi-perspective Material Ontology and the Materials Design Ontology) allowing for representing the dislocation simulation data efficiently. Moreover, we show a real-world use case by representing the discrete dislocation dynamics data as a knowledge graph (DisLocKG) that illustrates the relationship between them. We also developed a SPARQL endpoint that brings extensive flexibility to query DisLocKG.

Via

Access Paper or Ask Questions

Unsupervised Learning of Nanoindentation Data to Infer Microstructural Details of Complex Materials

Sep 12, 2023

Chen Zhang, Clémence Bos, Stefan Sandfeld, Ruth Schwaiger

Figure 1 for Unsupervised Learning of Nanoindentation Data to Infer Microstructural Details of Complex Materials

Figure 2 for Unsupervised Learning of Nanoindentation Data to Infer Microstructural Details of Complex Materials

Figure 3 for Unsupervised Learning of Nanoindentation Data to Infer Microstructural Details of Complex Materials

Figure 4 for Unsupervised Learning of Nanoindentation Data to Infer Microstructural Details of Complex Materials

Abstract:In this study, Cu-Cr composites were studied by nanoindentation. Arrays of indents were placed over large areas of the samples resulting in datasets consisting of several hundred measurements of Young's modulus and hardness at varying indentation depths. The unsupervised learning technique, Gaussian mixture model, was employed to analyze the data, which helped to determine the number of "mechanical phases" and the respective mechanical properties. Additionally, a cross-validation approach was introduced to infer whether the data quantity was adequate and to suggest the amount of data required for reliable predictions -- one of the often encountered but difficult to resolve issues in machine learning of materials science problems.

Via

Access Paper or Ask Questions

Instance Segmentation of Dislocations in TEM Images

Sep 07, 2023

Karina Ruzaeva, Kishan Govind, Marc Legros, Stefan Sandfeld

Figure 1 for Instance Segmentation of Dislocations in TEM Images

Figure 2 for Instance Segmentation of Dislocations in TEM Images

Figure 3 for Instance Segmentation of Dislocations in TEM Images

Figure 4 for Instance Segmentation of Dislocations in TEM Images

Abstract:Quantitative Transmission Electron Microscopy (TEM) during in-situ straining experiment is able to reveal the motion of dislocations -- linear defects in the crystal lattice of metals. In the domain of materials science, the knowledge about the location and movement of dislocations is important for creating novel materials with superior properties. A long-standing problem, however, is to identify the position and extract the shape of dislocations, which would ultimately help to create a digital twin of such materials. In this work, we quantitatively compare state-of-the-art instance segmentation methods, including Mask R-CNN and YOLOv8. The dislocation masks as the results of the instance segmentation are converted to mathematical lines, enabling quantitative analysis of dislocation length and geometry -- important information for the domain scientist, which we then propose to include as a novel length-aware quality metric for estimating the network performance. Our segmentation pipeline shows a high accuracy suitable for all domain-specific, further post-processing. Additionally, our physics-based metric turns out to perform much more consistently than typically used pixel-wise metrics.

* IEEE 23rd International Conference on Nanotechnology (2023) 1-6

Via

Access Paper or Ask Questions