Picture for Ruolin Chen

Ruolin Chen

ForesightSafety Bench: A Frontier Risk Evaluation and Governance Framework towards Safe AI

Add code
Feb 15, 2026
Viaarxiv icon

CogToM: A Comprehensive Theory of Mind Benchmark inspired by Human Cognition for Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

Towards Reliable Evaluation of Adversarial Robustness for Spiking Neural Networks

Add code
Dec 27, 2025
Viaarxiv icon

ScreenAudit: Detecting Screen Reader Accessibility Errors in Mobile Apps Using Large Language Models

Add code
Apr 02, 2025
Viaarxiv icon