Picture for Shaoyang Cui

Shaoyang Cui

VidNum-1.4K: A Comprehensive Benchmark for Video-based Numerical Reasoning

Add code
Apr 04, 2026
Viaarxiv icon

ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation

Add code
Mar 19, 2026
Viaarxiv icon