Picture for Wenjie Xiao

Wenjie Xiao

DeepTumorVQA: A Hierarchical 3D CT Benchmark for Stage-Wise Evaluation of Medical VLMs and Tool-Augmented Agents

Add code
May 10, 2026
Viaarxiv icon

RouteGuard: Internal-Signal Detection of Skill Poisoning in LLM Agents

Add code
Apr 24, 2026
Viaarxiv icon

TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research

Add code
Nov 10, 2025
Viaarxiv icon

DNT: a Deeply Normalized Transformer that can be trained by Momentum SGD

Add code
Jul 23, 2025
Viaarxiv icon

Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering

Add code
May 25, 2025
Viaarxiv icon