Picture for Kaifeng Lyu

Kaifeng Lyu

Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice

Add code
Dec 30, 2025
Viaarxiv icon

PCMind-2.1-Kaiyuan-2B Technical Report

Add code
Dec 08, 2025
Viaarxiv icon

Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression

Add code
Nov 17, 2025
Viaarxiv icon

When Bias Pretends to Be Truth: How Spurious Correlations Undermine Hallucination Detection in LLMs

Add code
Nov 10, 2025
Viaarxiv icon

How Far Are We from Optimal Reasoning Efficiency?

Add code
Jun 08, 2025
Viaarxiv icon

Data Mixing Can Induce Phase Transitions in Knowledge Acquisition

Add code
May 23, 2025
Viaarxiv icon

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Add code
Mar 25, 2025
Figure 1 for LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Figure 2 for LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Figure 3 for LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Figure 4 for LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Viaarxiv icon

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

Add code
Mar 17, 2025
Viaarxiv icon

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

Add code
Mar 05, 2025
Figure 1 for Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
Figure 2 for Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
Figure 3 for Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
Figure 4 for Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
Viaarxiv icon

Weak-to-Strong Generalization Even in Random Feature Networks, Provably

Add code
Mar 04, 2025
Viaarxiv icon