Picture for Dongrui Liu

Dongrui Liu

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Add code
Mar 04, 2026
Viaarxiv icon

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

Add code
Mar 02, 2026
Viaarxiv icon

Toward Personalized LLM-Powered Agents: Foundations, Evaluation, and Future Directions

Add code
Feb 26, 2026
Viaarxiv icon

A Trajectory-Based Safety Audit of Clawdbot (OpenClaw)

Add code
Feb 16, 2026
Viaarxiv icon

DeepSight: An All-in-One LM Safety Toolkit

Add code
Feb 12, 2026
Viaarxiv icon

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Add code
Feb 09, 2026
Viaarxiv icon

LPS-Bench: Benchmarking Safety Awareness of Computer-Use Agents in Long-Horizon Planning under Benign and Adversarial Scenarios

Add code
Feb 03, 2026
Viaarxiv icon

Interpreting Emergent Extreme Events in Multi-Agent Systems

Add code
Jan 28, 2026
Viaarxiv icon

RvB: Automating AI System Hardening via Iterative Red-Blue Games

Add code
Jan 27, 2026
Viaarxiv icon

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Add code
Jan 26, 2026
Viaarxiv icon