Picture for Yilun Yao

Yilun Yao

RealClawBench: Live OpenClaw Benchmarks from Real Developer-Agent Sessions

Add code
Jun 02, 2026
Viaarxiv icon

ConMoE: Expert-Pool Consolidation via Prototype Reassignment for MoE Compression

Add code
May 28, 2026
Viaarxiv icon

Harness-Bench: Measuring Harness Effects across Models in Realistic Agent Workflows

Add code
May 27, 2026
Viaarxiv icon

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

Add code
May 22, 2026
Viaarxiv icon

Formal Skill: Programmable Runtime Skills for Efficient and Accurate LLM Agents

Add code
May 19, 2026
Viaarxiv icon

ARC: Active and Reflection-driven Context Management for Long-Horizon Information Seeking Agents

Add code
Jan 17, 2026
Viaarxiv icon

MergeMoE: Efficient Compression of MoE Models via Expert Output Merging

Add code
Oct 16, 2025
Viaarxiv icon