Picture for Shuo wang

Shuo wang

MF2Summ: Multimodal Fusion for Video Summarization with Temporal Alignment

Add code
Jun 12, 2025
Figure 1 for MF2Summ: Multimodal Fusion for Video Summarization with Temporal Alignment
Figure 2 for MF2Summ: Multimodal Fusion for Video Summarization with Temporal Alignment
Figure 3 for MF2Summ: Multimodal Fusion for Video Summarization with Temporal Alignment
Figure 4 for MF2Summ: Multimodal Fusion for Video Summarization with Temporal Alignment
Viaarxiv icon