Picture for Dongrui Liu

Dongrui Liu

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Add code
Apr 08, 2026
Viaarxiv icon

ATBench: A Diverse and Realistic Agent Trajectory Benchmark for Safety Evaluation and Diagnosis

Add code
Apr 08, 2026
Viaarxiv icon

ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety

Add code
Apr 02, 2026
Viaarxiv icon

Understanding and Defending VLM Jailbreaks via Jailbreak-Related Representation Shift

Add code
Mar 18, 2026
Viaarxiv icon

HomeGuard: VLM-based Embodied Safeguard for Identifying Contextual Risk in Household Task

Add code
Mar 15, 2026
Viaarxiv icon

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Add code
Mar 04, 2026
Viaarxiv icon

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

Add code
Mar 02, 2026
Viaarxiv icon

Toward Personalized LLM-Powered Agents: Foundations, Evaluation, and Future Directions

Add code
Feb 26, 2026
Viaarxiv icon

A Trajectory-Based Safety Audit of Clawdbot (OpenClaw)

Add code
Feb 16, 2026
Viaarxiv icon

DeepSight: An All-in-One LM Safety Toolkit

Add code
Feb 12, 2026
Viaarxiv icon