Picture for Weichen Zhang

Weichen Zhang

AdaptInfer: Adaptive Token Pruning for Vision-Language Model Inference with Dynamical Text Guidance

Add code
Aug 08, 2025
Viaarxiv icon

SynSeg: Feature Synergy for Multi-Category Contrastive Learning in Open-Vocabulary Semantic Segmentation

Add code
Aug 08, 2025
Viaarxiv icon

MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them

Add code
Jul 28, 2025
Viaarxiv icon

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Add code
Jul 22, 2025
Viaarxiv icon

Self-Paced Collaborative and Adversarial Network for Unsupervised Domain Adaptation

Add code
Jun 24, 2025
Viaarxiv icon

Progressive Modality Cooperation for Multi-Modality Domain Adaptation

Add code
Jun 24, 2025
Viaarxiv icon

CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory

Add code
May 08, 2025
Viaarxiv icon

The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?

Add code
Apr 06, 2025
Viaarxiv icon

UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces

Add code
Mar 08, 2025
Viaarxiv icon

Understanding and Evaluating Hallucinations in 3D Visual Language Models

Add code
Feb 18, 2025
Viaarxiv icon