Picture for Weichen Zhang

Weichen Zhang

WorldFly: A World-Model-Based Vision-Language-Action Model for UAV Navigation

Add code
Jun 04, 2026
Viaarxiv icon

Agents' Last Exam

Add code
Jun 03, 2026
Viaarxiv icon

A Case for Agentic Tuning: From Documentation to Action in PostgreSQL

Add code
May 19, 2026
Viaarxiv icon

iWorld-Bench: A Benchmark for Interactive World Models with a Unified Action Generation Framework

Add code
May 06, 2026
Viaarxiv icon

A Benchmark for Interactive World Models with a Unified Action Generation Framework

Add code
May 05, 2026
Viaarxiv icon

How Far Are Large Multimodal Models from Human-Level Spatial Action? A Benchmark for Goal-Oriented Embodied Navigation in Urban Airspace

Add code
Apr 09, 2026
Viaarxiv icon

Neural Latent Arbitrary Lagrangian-Eulerian Grids for Fluid-Solid Interaction

Add code
Feb 28, 2026
Viaarxiv icon

Aerial World Model for Long-horizon Visual Generation and Navigation in 3D Space

Add code
Dec 26, 2025
Viaarxiv icon

AdaptInfer: Adaptive Token Pruning for Vision-Language Model Inference with Dynamical Text Guidance

Add code
Aug 08, 2025
Viaarxiv icon

SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation

Add code
Aug 08, 2025
Figure 1 for SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation
Figure 2 for SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation
Figure 3 for SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation
Figure 4 for SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation
Viaarxiv icon