Picture for Kaiwen Zhou

Kaiwen Zhou

SafePro: Evaluating the Safety of Professional-Level AI Agents

Add code
Jan 13, 2026
Viaarxiv icon

VPTracker: Global Vision-Language Tracking via Visual Prompt and MLLM

Add code
Dec 28, 2025
Viaarxiv icon

SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation

Add code
Nov 13, 2025
Figure 1 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Figure 2 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Figure 3 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Figure 4 for SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
Viaarxiv icon

SIRAJ: Diverse and Efficient Red-Teaming for LLM Agents via Distilled Structured Reasoning

Add code
Oct 30, 2025
Viaarxiv icon

MESH -- Understanding Videos Like Human: Measuring Hallucinations in Large Video Models

Add code
Sep 10, 2025
Viaarxiv icon

Uncertainty-Aware GUI Agent: Adaptive Perception through Component Recommendation and Human-in-the-Loop Refinement

Add code
Aug 06, 2025
Viaarxiv icon

"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models

Add code
Jul 17, 2025
Viaarxiv icon

Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills

Add code
Jun 12, 2025
Figure 1 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 2 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 3 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 4 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Viaarxiv icon

SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning

Add code
May 22, 2025
Figure 1 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 2 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 3 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Figure 4 for SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
Viaarxiv icon

GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent

Add code
May 22, 2025
Viaarxiv icon