Picture for Kaifeng Lyu

Kaifeng Lyu

Beyond Safe Data: Pretraining-Stage Alignment with Regular Safety Reflection

Add code
Jun 17, 2026
Viaarxiv icon

Fantastic Pretraining Optimizers and Where to Find Them II: Hyperball Optimization

Add code
Jun 15, 2026
Viaarxiv icon

Beyond Problem Solving: UOJ-Bench for Evaluating Code Generation, Hacking, and Repair in Competitive Programming

Add code
Jun 11, 2026
Viaarxiv icon

The Power of Power Law: Asymmetry Enables Compositional Reasoning

Add code
Apr 24, 2026
Viaarxiv icon

SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection

Add code
Mar 23, 2026
Viaarxiv icon

Fine-tuning MLLMs Without Forgetting Is Easier Than You Think

Add code
Mar 15, 2026
Viaarxiv icon

Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice

Add code
Dec 30, 2025
Viaarxiv icon

PCMind-2.1-Kaiyuan-2B Technical Report

Add code
Dec 08, 2025
Viaarxiv icon

Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression

Add code
Nov 17, 2025
Viaarxiv icon

When Bias Pretends to Be Truth: How Spurious Correlations Undermine Hallucination Detection in LLMs

Add code
Nov 10, 2025
Viaarxiv icon