Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiaofeng Gu

MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification

May 29, 2025

Yang Qiao, Xiaoyu Zhong, Xiaofeng Gu, Zhiguo Yu

Figure 1 for MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification

Figure 2 for MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification

Figure 3 for MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification

Figure 4 for MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification

Abstract:Multimodal information processing has become increasingly important for enhancing image classification performance. However, the intricate and implicit dependencies across different modalities often hinder conventional methods from effectively capturing fine-grained semantic interactions, thereby limiting their applicability in high-precision classification tasks. To address this issue, we propose a novel Multimodal Collaborative Fusion Network (MCFNet) designed for fine-grained classification. The proposed MCFNet architecture incorporates a regularized integrated fusion module that improves intra-modal feature representation through modality-specific regularization strategies, while facilitating precise semantic alignment via a hybrid attention mechanism. Additionally, we introduce a multimodal decision classification module, which jointly exploits inter-modal correlations and unimodal discriminative features by integrating multiple loss functions within a weighted voting paradigm. Extensive experiments and ablation studies on benchmark datasets demonstrate that the proposed MCFNet framework achieves consistent improvements in classification accuracy, confirming its effectiveness in modeling subtle cross-modal semantics.

Via

Access Paper or Ask Questions