Picture for Pingyang Dai

Pingyang Dai

Can Unified Generation and Understanding Models Maintain Semantic Equivalence Across Different Output Modalities?

Add code
Feb 27, 2026
Viaarxiv icon

Unleashing MLLMs on the Edge: A Unified Framework for Cross-Modal ReID via Adaptive SVD Distillation

Add code
Feb 13, 2026
Viaarxiv icon

Evolving, Not Training: Zero-Shot Reasoning Segmentation via Evolutionary Prompting

Add code
Dec 31, 2025
Viaarxiv icon

Understanding What Is Not Said:Referring Remote Sensing Image Segmentation with Scarce Expressions

Add code
Oct 26, 2025
Viaarxiv icon

RIS-LAD: A Benchmark and Model for Referring Low-Altitude Drone Image Segmentation

Add code
Jul 28, 2025
Viaarxiv icon

More Clear, More Flexible, More Precise: A Comprehensive Oriented Object Detection benchmark for UAV

Add code
Apr 28, 2025
Viaarxiv icon

Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search

Add code
Dec 19, 2024
Figure 1 for Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search
Figure 2 for Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search
Figure 3 for Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search
Figure 4 for Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search
Viaarxiv icon

RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification

Add code
Nov 02, 2024
Figure 1 for RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification
Figure 2 for RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification
Figure 3 for RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification
Figure 4 for RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification
Viaarxiv icon

PartFormer: Awakening Latent Diverse Representation from Vision Transformer for Object Re-Identification

Add code
Aug 29, 2024
Figure 1 for PartFormer: Awakening Latent Diverse Representation from Vision Transformer for Object Re-Identification
Figure 2 for PartFormer: Awakening Latent Diverse Representation from Vision Transformer for Object Re-Identification
Figure 3 for PartFormer: Awakening Latent Diverse Representation from Vision Transformer for Object Re-Identification
Figure 4 for PartFormer: Awakening Latent Diverse Representation from Vision Transformer for Object Re-Identification
Viaarxiv icon

Feature Denoising Diffusion Model for Blind Image Quality Assessment

Add code
Jan 22, 2024
Figure 1 for Feature Denoising Diffusion Model for Blind Image Quality Assessment
Figure 2 for Feature Denoising Diffusion Model for Blind Image Quality Assessment
Figure 3 for Feature Denoising Diffusion Model for Blind Image Quality Assessment
Figure 4 for Feature Denoising Diffusion Model for Blind Image Quality Assessment
Viaarxiv icon