Picture for Weichen Zhang

Weichen Zhang

Aerial World Model for Long-horizon Visual Generation and Navigation in 3D Space

Add code
Dec 26, 2025
Viaarxiv icon

AdaptInfer: Adaptive Token Pruning for Vision-Language Model Inference with Dynamical Text Guidance

Add code
Aug 08, 2025
Viaarxiv icon

SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation

Add code
Aug 08, 2025
Figure 1 for SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation
Figure 2 for SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation
Figure 3 for SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation
Figure 4 for SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation
Viaarxiv icon

MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them

Add code
Jul 28, 2025
Viaarxiv icon

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Add code
Jul 22, 2025
Figure 1 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Figure 2 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Figure 3 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Figure 4 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Viaarxiv icon

Self-Paced Collaborative and Adversarial Network for Unsupervised Domain Adaptation

Add code
Jun 24, 2025
Viaarxiv icon

Progressive Modality Cooperation for Multi-Modality Domain Adaptation

Add code
Jun 24, 2025
Viaarxiv icon

CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory

Add code
May 08, 2025
Viaarxiv icon

The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?

Add code
Apr 06, 2025
Viaarxiv icon

UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces

Add code
Mar 08, 2025
Viaarxiv icon