Picture for Yujun Zhou

Yujun Zhou

ASAP: Agent-System Co-Design for Wall-Clock-Centered Auto HPO Research for ML Experiments

Add code
Jun 23, 2026
Viaarxiv icon

Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents

Add code
Jun 11, 2026
Viaarxiv icon

AIRGuard: Guarding Agent Actions with Runtime Authority Control

Add code
May 27, 2026
Viaarxiv icon

AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills

Add code
May 13, 2026
Viaarxiv icon

Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data

Add code
Apr 20, 2026
Viaarxiv icon

PolicyLLM: Towards Excellent Comprehension of Public Policy for Large Language Models

Add code
Apr 14, 2026
Viaarxiv icon

Position: General Alignment Has Hit a Ceiling; Edge Alignment Must Be Taken Seriously

Add code
Feb 23, 2026
Viaarxiv icon

ProbeLLM: Automating Principled Diagnosis of LLM Failures

Add code
Feb 13, 2026
Viaarxiv icon

Capability-Oriented Training Induced Alignment Risk

Add code
Feb 12, 2026
Viaarxiv icon

Save the Good Prefix: Precise Error Penalization via Process-Supervised RL to Enhance LLM Reasoning

Add code
Jan 26, 2026
Viaarxiv icon