Picture for Weihao Xuan

Weihao Xuan

Direction-aware 3D Large Multimodal Models

Add code
Feb 22, 2026
Viaarxiv icon

Experience-Driven Multi-Agent Systems Are Training-free Context-aware Earth Observers

Add code
Jan 30, 2026
Viaarxiv icon

Sentipolis: Emotion-Aware Agents for Social Simulations

Add code
Jan 25, 2026
Viaarxiv icon

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents

Add code
Jan 12, 2026
Viaarxiv icon

Toward Global Large Language Models in Medicine

Add code
Jan 05, 2026
Viaarxiv icon

Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery

Add code
Dec 21, 2025
Viaarxiv icon

Retrieval-Augmented Generation in Medicine: A Scoping Review of Technical Implementations, Clinical Applications, and Ethical Considerations

Add code
Nov 13, 2025
Viaarxiv icon

Taming Object Hallucinations with Verified Atomic Confidence Estimation

Add code
Nov 12, 2025
Figure 1 for Taming Object Hallucinations with Verified Atomic Confidence Estimation
Figure 2 for Taming Object Hallucinations with Verified Atomic Confidence Estimation
Figure 3 for Taming Object Hallucinations with Verified Atomic Confidence Estimation
Figure 4 for Taming Object Hallucinations with Verified Atomic Confidence Estimation
Viaarxiv icon

DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding

Add code
May 27, 2025
Viaarxiv icon

DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response

Add code
May 27, 2025
Viaarxiv icon