Picture for Lin Sun

Lin Sun

External Experience Serving in Production LLM Systems: A Deployment-Oriented Study of Quality-Cost Trade-offs

Add code
Jun 10, 2026
Viaarxiv icon

RealClawBench: Live OpenClaw Benchmarks from Real Developer-Agent Sessions

Add code
Jun 02, 2026
Viaarxiv icon

A Primer in Post-Training Reasoning Data: What We Know About How It Works

Add code
Jun 01, 2026
Viaarxiv icon

Harness-Bench: Measuring Harness Effects across Models in Realistic Agent Workflows

Add code
May 27, 2026
Viaarxiv icon

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

Add code
May 22, 2026
Viaarxiv icon

Thinking with Reasoning Skills: Fewer Tokens, More Accuracy

Add code
Apr 23, 2026
Viaarxiv icon

Beyond Parameter Arithmetic: Sparse Complementary Fusion for Distribution-Aware Model Merging

Add code
Feb 12, 2026
Viaarxiv icon

Beyond Static Alignment: Hierarchical Policy Control for LLM Safety via Risk-Aware Chain-of-Thought

Add code
Feb 06, 2026
Viaarxiv icon

FABLE: Forest-Based Adaptive Bi-Path LLM-Enhanced Retrieval for Multi-Document Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

TriPlay-RL: Tri-Role Self-Play Reinforcement Learning for LLM Safety Alignment

Add code
Jan 26, 2026
Viaarxiv icon