Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction

Jul 16, 2025

Wei Huang, Yuqiang Huang, Yanan Wu, Tianhe Xu, Junting Wang, Hao Zhang

Figure 1 for A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction

Figure 2 for A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction

Figure 3 for A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction

Figure 4 for A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction

Share this with someone who'll enjoy it:

Abstract:Sound speed profiles (SSPs) are essential parameters underwater that affects the propagation mode of underwater signals and has a critical impact on the energy efficiency of underwater acoustic communication and accuracy of underwater acoustic positioning. Traditionally, SSPs can be obtained by matching field processing (MFP), compressive sensing (CS), and deep learning (DL) methods. However, existing methods mainly rely on on-site underwater sonar observation data, which put forward strict requirements on the deployment of sonar observation systems. To achieve high-precision estimation of sound velocity distribution in a given sea area without on-site underwater data measurement, we propose a multi-modal data-fusion generative adversarial network model with residual attention block (MDF-RAGAN) for SSP construction. To improve the model's ability for capturing global spatial feature correlations, we embedded the attention mechanisms, and use residual modules for deeply capturing small disturbances in the deep ocean sound velocity distribution caused by changes of SST. Experimental results on real open dataset show that the proposed model outperforms other state-of-the-art methods, which achieves an accuracy with an error of less than 0.3m/s. Specifically, MDF-RAGAN not only outperforms convolutional neural network (CNN) and spatial interpolation (SITP) by nearly a factor of two, but also achieves about 65.8\% root mean square error (RMSE) reduction compared to mean profile, which fully reflects the enhancement of overall profile matching by multi-source fusion and cross-modal attention.

View paper on

Share this with someone who'll enjoy it:

Title:A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction

Paper and Code