Picture for Ziyu Ma

Ziyu Ma

DrVideo: Document Retrieval Based Long Video Understanding

Add code
Jun 18, 2024
Figure 1 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 2 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 3 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 4 for DrVideo: Document Retrieval Based Long Video Understanding
Viaarxiv icon

GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering

Add code
Feb 04, 2024
Viaarxiv icon

Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation

Add code
Jul 05, 2022
Figure 1 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Figure 2 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Figure 3 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Figure 4 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Viaarxiv icon

Hybrid Mutimodal Fusion for Dimensional Emotion Recognition

Add code
Oct 16, 2021
Figure 1 for Hybrid Mutimodal Fusion for Dimensional Emotion Recognition
Figure 2 for Hybrid Mutimodal Fusion for Dimensional Emotion Recognition
Figure 3 for Hybrid Mutimodal Fusion for Dimensional Emotion Recognition
Figure 4 for Hybrid Mutimodal Fusion for Dimensional Emotion Recognition
Viaarxiv icon