Picture for Chaochao Lu

Chaochao Lu

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Add code
May 28, 2026
Viaarxiv icon

Harmony in Diversity: Multi-domain Contrastive Policy Optimization for Large Reasoning Models

Add code
May 25, 2026
Viaarxiv icon

Metacognition as Reward: Reinforcing LLM Reasoning via Knowledge and Regulation Signals

Add code
May 22, 2026
Viaarxiv icon

REFLECTOR: Internalizing Step-wise Reflection against Indirect Jailbreak

Add code
May 20, 2026
Viaarxiv icon

TSHA: A Benchmark for Visual Language Models in Trustworthy Safety Hazard Assessment Scenarios

Add code
Mar 31, 2026
Viaarxiv icon

Native Reasoning Models: Training Language Models to Reason on Unverifiable Data

Add code
Feb 12, 2026
Viaarxiv icon

Decoupled Reasoning with Implicit Fact Tokens (DRIFT): A Dual-Model Framework for Efficient Long-Context Inference

Add code
Feb 10, 2026
Viaarxiv icon

CauScale: Neural Causal Discovery at Scale

Add code
Feb 09, 2026
Viaarxiv icon

Can Post-Training Transform LLMs into Causal Reasoners?

Add code
Feb 06, 2026
Viaarxiv icon

Risky-Bench: Probing Agentic Safety Risks under Real-World Deployment

Add code
Feb 03, 2026
Viaarxiv icon