Picture for Haiyang Wei

Haiyang Wei

Zero-Permission Manipulation: Can We Trust Large Multimodal Model Powered GUI Agents?

Add code
Jan 18, 2026
Viaarxiv icon

Boost Image Captioning with Knowledge Reasoning

Add code
Nov 02, 2020
Figure 1 for Boost Image Captioning with Knowledge Reasoning
Figure 2 for Boost Image Captioning with Knowledge Reasoning
Figure 3 for Boost Image Captioning with Knowledge Reasoning
Figure 4 for Boost Image Captioning with Knowledge Reasoning
Viaarxiv icon