Picture for Bingnan Liu

Bingnan Liu

WildRoadBench: A Wild Aerial Road-Damage Grounding Benchmark for Vision-Language Models and Autonomous Agents

Add code
May 19, 2026
Viaarxiv icon

Risky-Bench: Probing Agentic Safety Risks under Real-World Deployment

Add code
Feb 03, 2026
Viaarxiv icon