Picture for Yunke Wang

Yunke Wang

HiCI: Hierarchical Construction-Integration for Long-Context Attention

Add code
Mar 21, 2026
Viaarxiv icon

SmoothTurn: Learning to Turn Smoothly for Agile Navigation with Quadrupedal Robots

Add code
Mar 13, 2026
Viaarxiv icon

Affordance Field Intervention: Enabling VLAs to Escape Memory Traps in Robotic Manipulation

Add code
Dec 08, 2025
Viaarxiv icon

Action-aware Dynamic Pruning for Efficient Vision-Language-Action Manipulation

Add code
Sep 26, 2025
Viaarxiv icon

Marine Saliency Segmenter: Object-Focused Conditional Diffusion with Region-Level Semantic Knowledge Distillation

Add code
Apr 03, 2025
Viaarxiv icon

VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation

Add code
Feb 04, 2025
Figure 1 for VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation
Figure 2 for VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation
Figure 3 for VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation
Figure 4 for VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation
Viaarxiv icon

Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration

Add code
Jan 08, 2025
Figure 1 for Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration
Figure 2 for Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration
Figure 3 for Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration
Figure 4 for Color Correction Meets Cross-Spectral Refinement: A Distribution-Aware Diffusion for Underwater Image Restoration
Viaarxiv icon

FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation

Add code
Aug 26, 2024
Figure 1 for FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation
Figure 2 for FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation
Figure 3 for FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation
Figure 4 for FusionSAM: Latent Space driven Segment Anything Model for Multimodal Fusion and Segmentation
Viaarxiv icon

Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V

Add code
Mar 18, 2024
Figure 1 for Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V
Figure 2 for Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V
Figure 3 for Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V
Figure 4 for Collage Prompting: Budget-Friendly Visual Recognition with GPT-4V
Viaarxiv icon

Visual Imitation Learning with Calibrated Contrastive Representation

Add code
Jan 21, 2024
Viaarxiv icon