Picture for Wei Xu

Wei Xu

Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking

Add code
Mar 05, 2026
Viaarxiv icon

UniVBench: Towards Unified Evaluation for Video Foundation Models

Add code
Feb 25, 2026
Viaarxiv icon

Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL

Add code
Feb 17, 2026
Viaarxiv icon

Do Vision-Language Models Respect Contextual Integrity in Location Disclosure?

Add code
Feb 04, 2026
Viaarxiv icon

A Human-Centered Privacy Approach (HCP) to AI

Add code
Feb 04, 2026
Viaarxiv icon

GeoRC: A Benchmark for Geolocation Reasoning Chains

Add code
Jan 29, 2026
Viaarxiv icon

Reducing False Positives in Static Bug Detection with LLMs: An Empirical Study in Industry

Add code
Jan 26, 2026
Viaarxiv icon

Faithfulness vs. Safety: Evaluating LLM Behavior Under Counterfactual Medical Evidence

Add code
Jan 17, 2026
Viaarxiv icon

CogRail: Benchmarking VLMs in Cognitive Intrusion Perception for Intelligent Railway Transportation Systems

Add code
Jan 14, 2026
Viaarxiv icon

Variable-Length Wideband CSI Feedback via Loewner Interpolation and Deep Learning

Add code
Jan 13, 2026
Viaarxiv icon