Picture for Shunping Ji

Shunping Ji

Towards One-to-Many Temporal Grounding

Add code
Jun 04, 2026
Viaarxiv icon

SaSaSaSa2VA: 2nd Place of the 5th PVUW MeViS-Text Track

Add code
Mar 28, 2026
Viaarxiv icon

SAMTok: Representing Any Mask with Two Words

Add code
Jan 22, 2026
Viaarxiv icon

Opt3DGS: Optimizing 3D Gaussian Splatting with Adaptive Exploration and Curvature-Aware Exploitation

Add code
Nov 17, 2025
Viaarxiv icon

SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment

Add code
Jul 03, 2025
Viaarxiv icon

Dense360: Dense Understanding from Omnidirectional Panoramas

Add code
Jun 17, 2025
Viaarxiv icon

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Add code
Apr 14, 2025
Figure 1 for Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Figure 2 for Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Figure 3 for Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Figure 4 for Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
Viaarxiv icon

Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs

Add code
Jan 08, 2025
Figure 1 for Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs
Figure 2 for Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs
Figure 3 for Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs
Figure 4 for Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs
Viaarxiv icon

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Add code
Jan 07, 2025
Figure 1 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 2 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 3 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 4 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Viaarxiv icon

A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images

Add code
Dec 31, 2024
Figure 1 for A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images
Figure 2 for A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images
Figure 3 for A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images
Figure 4 for A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images
Viaarxiv icon