Picture for Guangyao Su

Guangyao Su

Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure

Add code
Mar 05, 2026
Viaarxiv icon

CoCo-Bench: A Comprehensive Code Benchmark For Multi-task Large Language Model Evaluation

Add code
Apr 29, 2025
Viaarxiv icon