Picture for Mustapha Abdullahi

Mustapha Abdullahi

Firebolt-VL: Efficient Vision-Language Understanding with Cross-Modality Modulation

Add code
Apr 07, 2026
Viaarxiv icon

Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation

Add code
Nov 18, 2025
Figure 1 for Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation
Figure 2 for Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation
Figure 3 for Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation
Figure 4 for Viper-F1: Fast and Fine-Grained Multimodal Understanding with Cross-Modal State-Space Modulation
Viaarxiv icon