Picture for Xia Hu

Xia Hu

TrinityGuard: A Unified Framework for Safeguarding Multi-Agent Systems

Add code
Mar 16, 2026
Viaarxiv icon

RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback

Add code
Mar 12, 2026
Viaarxiv icon

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

Add code
Mar 02, 2026
Viaarxiv icon

A Benchmark and Knowledge-Grounded Framework for Advanced Multimodal Personalization Study

Add code
Feb 22, 2026
Viaarxiv icon

A Trajectory-Based Safety Audit of Clawdbot (OpenClaw)

Add code
Feb 16, 2026
Viaarxiv icon

DeepSight: An All-in-One LM Safety Toolkit

Add code
Feb 12, 2026
Viaarxiv icon

Decoupled Reasoning with Implicit Fact Tokens (DRIFT): A Dual-Model Framework for Efficient Long-Context Inference

Add code
Feb 10, 2026
Viaarxiv icon

RAPO: Risk-Aware Preference Optimization for Generalizable Safe Reasoning

Add code
Feb 04, 2026
Viaarxiv icon

LPS-Bench: Benchmarking Safety Awareness of Computer-Use Agents in Long-Horizon Planning under Benign and Adversarial Scenarios

Add code
Feb 03, 2026
Viaarxiv icon

Interpreting Emergent Extreme Events in Multi-Agent Systems

Add code
Jan 28, 2026
Viaarxiv icon