Picture for Yilin Yuan

Yilin Yuan

DTP: A Simple yet Effective Distracting Token Pruning Framework for Vision-Language Action Models

Add code
Jan 22, 2026
Viaarxiv icon

Multimodal Conversation Structure Understanding

Add code
May 23, 2025
Figure 1 for Multimodal Conversation Structure Understanding
Figure 2 for Multimodal Conversation Structure Understanding
Figure 3 for Multimodal Conversation Structure Understanding
Figure 4 for Multimodal Conversation Structure Understanding
Viaarxiv icon