Picture for Zhewen Tan

Zhewen Tan

TriPlay-RL: Tri-Role Self-Play Reinforcement Learning for LLM Safety Alignment

Add code
Jan 26, 2026
Viaarxiv icon

ARC: Active and Reflection-driven Context Management for Long-Horizon Information Seeking Agents

Add code
Jan 17, 2026
Viaarxiv icon

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Add code
Dec 14, 2025
Viaarxiv icon