Video Saliency Prediction


SalFormer360: a transformer-based saliency estimation model for 360-degree videos

Add code
Feb 04, 2026
Viaarxiv icon

Conditional Flow Matching for Visually-Guided Acoustic Highlighting

Add code
Feb 03, 2026
Viaarxiv icon

Gaze Prediction in Virtual Reality Without Eye Tracking Using Visual and Head Motion Cues

Add code
Jan 26, 2026
Viaarxiv icon

Not all Blends are Equal: The BLEMORE Dataset of Blended Emotion Expressions with Relative Salience Annotations

Add code
Jan 19, 2026
Viaarxiv icon

Convolutions Need Registers Too: HVS-Inspired Dynamic Attention for Video Quality Assessment

Add code
Jan 16, 2026
Viaarxiv icon

Moment and Highlight Detection via MLLM Frame Segmentation

Add code
Dec 13, 2025
Figure 1 for Moment and Highlight Detection via MLLM Frame Segmentation
Figure 2 for Moment and Highlight Detection via MLLM Frame Segmentation
Figure 3 for Moment and Highlight Detection via MLLM Frame Segmentation
Figure 4 for Moment and Highlight Detection via MLLM Frame Segmentation
Viaarxiv icon

Disentangled Concepts Speak Louder Than Words:Explainable Video Action Recognition

Add code
Nov 05, 2025
Viaarxiv icon

Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction

Add code
Apr 19, 2025
Figure 1 for Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction
Figure 2 for Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction
Figure 3 for Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction
Figure 4 for Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction
Viaarxiv icon

Saliency-guided Emotion Modeling: Predicting Viewer Reactions from Video Stimuli

Add code
May 25, 2025
Viaarxiv icon

Shallow Features Matter: Hierarchical Memory with Heterogeneous Interaction for Unsupervised Video Object Segmentation

Add code
Jul 30, 2025
Figure 1 for Shallow Features Matter: Hierarchical Memory with Heterogeneous Interaction for Unsupervised Video Object Segmentation
Figure 2 for Shallow Features Matter: Hierarchical Memory with Heterogeneous Interaction for Unsupervised Video Object Segmentation
Figure 3 for Shallow Features Matter: Hierarchical Memory with Heterogeneous Interaction for Unsupervised Video Object Segmentation
Figure 4 for Shallow Features Matter: Hierarchical Memory with Heterogeneous Interaction for Unsupervised Video Object Segmentation
Viaarxiv icon