Picture for Ling Shao

Ling Shao

Terminus Group, Beijing, China

Contrastive Graph Modeling for Cross-Domain Few-Shot Medical Image Segmentation

Add code
Dec 25, 2025
Viaarxiv icon

Adversarial Robustness in Zero-Shot Learning:An Empirical Study on Class and Concept-Level Vulnerabilities

Add code
Dec 21, 2025
Viaarxiv icon

MetaTPT: Meta Test-time Prompt Tuning for Vision-Language Models

Add code
Dec 13, 2025
Viaarxiv icon

A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models

Add code
Nov 19, 2025
Viaarxiv icon

Spatial Preference Rewarding for MLLMs Spatial Understanding

Add code
Oct 16, 2025
Viaarxiv icon

Aesthetic Image Captioning with Saliency Enhanced MLLMs

Add code
Sep 04, 2025
Viaarxiv icon

Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis

Add code
Jul 09, 2025
Figure 1 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 2 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 3 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 4 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Viaarxiv icon

Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image

Add code
May 20, 2025
Viaarxiv icon

Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning

Add code
Mar 17, 2025
Viaarxiv icon

MambaIC: State Space Models for High-Performance Learned Image Compression

Add code
Mar 16, 2025
Viaarxiv icon