The exploration of transition state (TS) geometries is crucial for elucidating chemical reaction mechanisms and modeling their kinetics. In recent years, machine learning (ML) models have shown remarkable performance in TS geometry prediction. However, they require 3D geometries of reactants and products that can be challenging to determine. To tackle this, we introduce TSDiff, a novel ML model based on the stochastic diffusion method, which generates the 3D geometry of the TS from a 2D graph composed of molecular connectivity. Despite of this simple input, TSDiff generated TS geometries with high accuracy, outperforming existing ML models that utilize geometric information. Moreover, the generative model approach enabled the sampling of various valid TS conformations, even though only a single conformation for each reaction was used in training. Consequently, TSDiff also found more favorable reaction pathways with lower barrier heights than those in the reference database. We anticipate that this approach will be useful for exploring complex reactions that require the consideration of multiple TS conformations.
As quantum chemical properties have a significant dependence on their geometries, graph neural networks (GNNs) using 3D geometric information have achieved high prediction accuracy in many tasks. However, they often require 3D geometries obtained from high-level quantum mechanical calculations, which are practically infeasible, limiting their applicability in real-world problems. To tackle this, we propose a method to accurately predict the properties with relatively easy-to-obtain geometries (e.g., optimized geometries from the molecular force field). In this method, the input geometry, regarded as the corrupted geometry of the correct one, gradually approaches the correct one as it passes through the stacked denoising layers. We investigated the performance of the proposed method using 3D message-passing architectures for two prediction tasks: molecular properties and chemical reaction property. The reduction of positional errors through the denoising process contributed to performance improvement by increasing the mutual information between the correct and corrupted geometries. Moreover, our analysis of the correlation between denoising power and predictive accuracy demonstrates the effectiveness of the denoising process.