Picture for Yanyun Qu

Yanyun Qu

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Add code
Apr 15, 2026
Viaarxiv icon

Direct Segmentation without Logits Optimization for Training-Free Open-Vocabulary Semantic Segmentation

Add code
Apr 09, 2026
Viaarxiv icon

Beyond Semantics: Uncovering the Physics of Fakes via Universal Physical Descriptors for Cross-Modal Synthetic Detection

Add code
Apr 06, 2026
Viaarxiv icon

PC-CrossDiff: Point-Cluster Dual-Level Cross-Modal Differential Attention for Unified 3D Referring and Segmentation

Add code
Mar 18, 2026
Viaarxiv icon

SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding

Add code
Aug 28, 2025
Figure 1 for SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding
Figure 2 for SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding
Figure 3 for SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding
Figure 4 for SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding
Viaarxiv icon

UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration

Add code
Jul 31, 2025
Figure 1 for UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration
Figure 2 for UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration
Figure 3 for UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration
Figure 4 for UniLDiff: Unlocking the Power of Diffusion Priors for All-in-One Image Restoration
Viaarxiv icon

One-for-More: Continual Diffusion Model for Anomaly Detection

Add code
Feb 27, 2025
Viaarxiv icon

CLIP-FSAC++: Few-Shot Anomaly Classification with Anomaly Descriptor Based on CLIP

Add code
Dec 05, 2024
Figure 1 for CLIP-FSAC++: Few-Shot Anomaly Classification with Anomaly Descriptor Based on CLIP
Figure 2 for CLIP-FSAC++: Few-Shot Anomaly Classification with Anomaly Descriptor Based on CLIP
Figure 3 for CLIP-FSAC++: Few-Shot Anomaly Classification with Anomaly Descriptor Based on CLIP
Figure 4 for CLIP-FSAC++: Few-Shot Anomaly Classification with Anomaly Descriptor Based on CLIP
Viaarxiv icon

Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation

Add code
Oct 25, 2024
Figure 1 for Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation
Figure 2 for Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation
Figure 3 for Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation
Figure 4 for Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation
Viaarxiv icon

LLaCA: Multimodal Large Language Continual Assistant

Add code
Oct 08, 2024
Viaarxiv icon