Video Segmentation


Refer-Agent: A Collaborative Multi-Agent System with Reasoning and Reflection for Referring Video Object Segmentation

Add code
Feb 03, 2026
Viaarxiv icon

Finding Optimal Video Moment without Training: Gaussian Boundary Optimization for Weakly Supervised Video Grounding

Add code
Feb 03, 2026
Viaarxiv icon

Multi-Objective Optimization for Synthetic-to-Real Style Transfer

Add code
Feb 03, 2026
Viaarxiv icon

SlowFocus: Enhancing Fine-grained Temporal Understanding in Video LLM

Add code
Feb 03, 2026
Viaarxiv icon

MLV-Edit: Towards Consistent and Highly Efficient Editing for Minute-Level Videos

Add code
Feb 02, 2026
Viaarxiv icon

MTC-VAE: Multi-Level Temporal Compression with Content Awareness

Add code
Feb 01, 2026
Viaarxiv icon

LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization

Add code
Feb 02, 2026
Viaarxiv icon

Segment to Focus: Guiding Latent Action Models in the Presence of Distractors

Add code
Feb 02, 2026
Viaarxiv icon

LogicGaze: Benchmarking Causal Consistency in Visual Narratives via Counterfactual Verification

Add code
Jan 30, 2026
Viaarxiv icon

Sem-NaVAE: Semantically-Guided Outdoor Mapless Navigation via Generative Trajectory Priors

Add code
Feb 01, 2026
Viaarxiv icon