Picture for Yuwei Wu

Yuwei Wu

GUI Knowledge Bench: Revealing the Knowledge Gap Behind VLM Failures in GUI Tasks

Add code
Oct 30, 2025
Viaarxiv icon

PIP-LLM: Integrating PDDL-Integer Programming with LLMs for Coordinating Multi-Robot Teams Using Natural Language

Add code
Oct 26, 2025
Viaarxiv icon

Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning

Add code
Oct 06, 2025
Figure 1 for Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning
Figure 2 for Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning
Figure 3 for Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning
Figure 4 for Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning
Viaarxiv icon

Curvature Learning for Generalization of Hyperbolic Neural Networks

Add code
Aug 24, 2025
Viaarxiv icon

Sekai: A Video Dataset towards World Exploration

Add code
Jun 18, 2025
Viaarxiv icon

Hyperbolic Dual Feature Augmentation for Open-Environment

Add code
Jun 10, 2025
Viaarxiv icon

Large Language Models are Demonstration Pre-Selectors for Themselves

Add code
Jun 06, 2025
Viaarxiv icon

Multi-Sourced Compositional Generalization in Visual Question Answering

Add code
May 29, 2025
Viaarxiv icon

Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL

Add code
May 21, 2025
Figure 1 for Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL
Figure 2 for Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL
Figure 3 for Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL
Figure 4 for Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL
Viaarxiv icon

Diving into the Fusion of Monocular Priors for Generalized Stereo Matching

Add code
May 20, 2025
Figure 1 for Diving into the Fusion of Monocular Priors for Generalized Stereo Matching
Figure 2 for Diving into the Fusion of Monocular Priors for Generalized Stereo Matching
Figure 3 for Diving into the Fusion of Monocular Priors for Generalized Stereo Matching
Figure 4 for Diving into the Fusion of Monocular Priors for Generalized Stereo Matching
Viaarxiv icon