Video Semantic Segmentation


Video semantic segmentation is the process of segmenting objects in videos into different classes or categories.

Dual Semantic-Aware Network for Noise Suppressed Ultrasound Video Segmentation

Add code
Jul 10, 2025
Viaarxiv icon

KeyRe-ID: Keypoint-Guided Person Re-Identification using Part-Aware Representation in Videos

Add code
Jul 10, 2025
Viaarxiv icon

LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Add code
Jul 03, 2025
Viaarxiv icon

Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization

Add code
Jun 25, 2025
Viaarxiv icon

SAM4D: Segment Anything in Camera and LiDAR Streams

Add code
Jun 26, 2025
Viaarxiv icon

Dense Video Captioning using Graph-based Sentence Summarization

Add code
Jun 25, 2025
Viaarxiv icon

A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects

Add code
Jun 16, 2025
Viaarxiv icon

StgcDiff: Spatial-Temporal Graph Condition Diffusion for Sign Language Transition Generation

Add code
Jun 16, 2025
Viaarxiv icon

Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment

Add code
Jun 17, 2025
Viaarxiv icon

MS4UI: A Dataset for Multi-modal Summarization of User Interface Instructional Videos

Add code
Jun 14, 2025
Viaarxiv icon