Picture for Mingqi Gao

Mingqi Gao

Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe

Add code
May 05, 2026
Viaarxiv icon

Reinforcing 3D Understanding in Point-VLMs via Geometric Reward Credit Assignment

Add code
Apr 23, 2026
Viaarxiv icon

UniSurgSAM: A Unified Promptable Model for Reliable Surgical Video Segmentation

Add code
Apr 04, 2026
Viaarxiv icon

Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track

Add code
Mar 24, 2026
Viaarxiv icon

Learning Trajectory-Aware Multimodal Large Language Models for Video Reasoning Segmentation

Add code
Mar 23, 2026
Viaarxiv icon

Show Me When and Where: Towards Referring Video Object Segmentation in the Wild

Add code
Mar 15, 2026
Viaarxiv icon

SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from Videos

Add code
Dec 09, 2025
Viaarxiv icon

ArtiWorld: LLM-Driven Articulation of 3D Objects in Scenes

Add code
Nov 18, 2025
Viaarxiv icon

Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model

Add code
Sep 09, 2025
Figure 1 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 2 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 3 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 4 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Viaarxiv icon

Unlocking the Potential of Diffusion Priors in Blind Face Restoration

Add code
Aug 12, 2025
Figure 1 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Figure 2 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Figure 3 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Figure 4 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Viaarxiv icon