Picture for Mingqi Gao

Mingqi Gao

Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track

Add code
Mar 24, 2026
Viaarxiv icon

Learning Trajectory-Aware Multimodal Large Language Models for Video Reasoning Segmentation

Add code
Mar 23, 2026
Viaarxiv icon

Show Me When and Where: Towards Referring Video Object Segmentation in the Wild

Add code
Mar 15, 2026
Viaarxiv icon

SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from Videos

Add code
Dec 09, 2025
Viaarxiv icon

ArtiWorld: LLM-Driven Articulation of 3D Objects in Scenes

Add code
Nov 18, 2025
Viaarxiv icon

Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model

Add code
Sep 09, 2025
Figure 1 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 2 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 3 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 4 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Viaarxiv icon

Unlocking the Potential of Diffusion Priors in Blind Face Restoration

Add code
Aug 12, 2025
Figure 1 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Figure 2 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Figure 3 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Figure 4 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Viaarxiv icon

LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization

Add code
Jun 09, 2025
Viaarxiv icon

THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation

Add code
Jun 07, 2025
Viaarxiv icon

ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking

Add code
May 13, 2025
Viaarxiv icon