Picture for Dongrui Liu

Dongrui Liu

LPS-Bench: Benchmarking Safety Awareness of Computer-Use Agents in Long-Horizon Planning under Benign and Adversarial Scenarios

Add code
Feb 03, 2026
Viaarxiv icon

Interpreting Emergent Extreme Events in Multi-Agent Systems

Add code
Jan 28, 2026
Viaarxiv icon

RvB: Automating AI System Hardening via Iterative Red-Blue Games

Add code
Jan 27, 2026
Viaarxiv icon

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Add code
Jan 26, 2026
Viaarxiv icon

INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems

Add code
Jan 21, 2026
Viaarxiv icon

The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution

Add code
Jan 21, 2026
Viaarxiv icon

Loop as a Bridge: Can Looped Transformers Truly Link Representation Space and Natural Language Outputs?

Add code
Jan 15, 2026
Viaarxiv icon

ExGRPO: Learning to Reason from Experience

Add code
Oct 02, 2025
Figure 1 for ExGRPO: Learning to Reason from Experience
Figure 2 for ExGRPO: Learning to Reason from Experience
Figure 3 for ExGRPO: Learning to Reason from Experience
Figure 4 for ExGRPO: Learning to Reason from Experience
Viaarxiv icon

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Add code
Sep 18, 2025
Viaarxiv icon

The LLM Already Knows: Estimating LLM-Perceived Question Difficulty via Hidden Representations

Add code
Sep 16, 2025
Viaarxiv icon