Picture for Qihui Zhu

Qihui Zhu

HAWK: Head Importance-Aware Visual Token Pruning in Multimodal Models

Add code
Apr 09, 2026
Viaarxiv icon

Improving Safety Alignment via Balanced Direct Preference Optimization

Add code
Mar 24, 2026
Viaarxiv icon

Mind over Space: Can Multimodal Large Language Models Mentally Navigate?

Add code
Mar 23, 2026
Viaarxiv icon

World2Mind: Cognition Toolkit for Allocentric Spatial Reasoning in Foundation Models

Add code
Mar 10, 2026
Viaarxiv icon

From reactive to cognitive: brain-inspired spatial intelligence for embodied agents

Add code
Aug 24, 2025
Figure 1 for From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Figure 2 for From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Figure 3 for From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Figure 4 for From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Viaarxiv icon

Enhancing Adversarial Robustness of Vision Language Models via Adversarial Mixture Prompt Tuning

Add code
May 23, 2025
Viaarxiv icon