Picture for Shaoxiong Guo

Shaoxiong Guo

STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models

Add code
Sep 30, 2025
Figure 1 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Figure 2 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Figure 3 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Figure 4 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Viaarxiv icon

VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-Tuning

Add code
Jun 20, 2024
Figure 1 for VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-Tuning
Figure 2 for VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-Tuning
Figure 3 for VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-Tuning
Figure 4 for VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-Tuning
Viaarxiv icon