Picture for Bindu Verma

Bindu Verma

Tri-FusionNet: Enhancing Image Description Generation with Transformer-based Fusion Network and Dual Attention Mechanism

Add code
Apr 23, 2025
Figure 1 for Tri-FusionNet: Enhancing Image Description Generation with Transformer-based Fusion Network and Dual Attention Mechanism
Figure 2 for Tri-FusionNet: Enhancing Image Description Generation with Transformer-based Fusion Network and Dual Attention Mechanism
Figure 3 for Tri-FusionNet: Enhancing Image Description Generation with Transformer-based Fusion Network and Dual Attention Mechanism
Figure 4 for Tri-FusionNet: Enhancing Image Description Generation with Transformer-based Fusion Network and Dual Attention Mechanism
Viaarxiv icon

Advanced Chest X-Ray Analysis via Transformer-Based Image Descriptors and Cross-Model Attention Mechanism

Add code
Apr 23, 2025
Viaarxiv icon

Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation

Add code
Apr 23, 2025
Figure 1 for Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation
Figure 2 for Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation
Figure 3 for Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation
Figure 4 for Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation
Viaarxiv icon