Picture for Yunke Wang

Yunke Wang

Affordance Field Intervention: Enabling VLAs to Escape Memory Traps in Robotic Manipulation

Add code
Dec 08, 2025
Viaarxiv icon

Action-aware Dynamic Pruning for Efficient Vision-Language-Action Manipulation

Add code
Sep 26, 2025
Viaarxiv icon

Marine Saliency Segmenter: Object-Focused Conditional Diffusion with Region-Level Semantic Knowledge Distillation

Add code
Apr 03, 2025
Viaarxiv icon

VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation

Add code
Feb 04, 2025
Figure 1 for VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation
Figure 2 for VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation
Figure 3 for VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation
Figure 4 for VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation
Viaarxiv icon

Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration

Add code
Jan 08, 2025
Figure 1 for Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration
Figure 2 for Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration
Figure 3 for Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration
Figure 4 for Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration
Viaarxiv icon

FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation

Add code
Aug 26, 2024
Figure 1 for FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation
Figure 2 for FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation
Figure 3 for FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation
Figure 4 for FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation
Viaarxiv icon

Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V

Add code
Mar 18, 2024
Figure 1 for Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V
Figure 2 for Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V
Figure 3 for Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V
Figure 4 for Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V
Viaarxiv icon

Visual Imitation Learning with Calibrated Contrastive Representation

Add code
Jan 21, 2024
Viaarxiv icon

MAEDiff: Masked Autoencoder-enhanced Diffusion Models for Unsupervised Anomaly Detection in Brain Images

Add code
Jan 19, 2024
Viaarxiv icon

Imitation Learning from Purified Demonstration

Add code
Oct 11, 2023
Viaarxiv icon