Picture for Kaifeng Lyu

Kaifeng Lyu

SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection

Add code
Mar 23, 2026
Viaarxiv icon

Fine-tuning MLLMs Without Forgetting Is Easier Than You Think

Add code
Mar 15, 2026
Viaarxiv icon

Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice

Add code
Dec 30, 2025
Viaarxiv icon

PCMind-2.1-Kaiyuan-2B Technical Report

Add code
Dec 08, 2025
Viaarxiv icon

Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression

Add code
Nov 17, 2025
Viaarxiv icon

When Bias Pretends to Be Truth: How Spurious Correlations Undermine Hallucination Detection in LLMs

Add code
Nov 10, 2025
Viaarxiv icon

How Far Are We from Optimal Reasoning Efficiency?

Add code
Jun 08, 2025
Viaarxiv icon

Data Mixing Can Induce Phase Transitions in Knowledge Acquisition

Add code
May 23, 2025
Viaarxiv icon

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Add code
Mar 25, 2025
Figure 1 for LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Figure 2 for LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Figure 3 for LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Figure 4 for LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Viaarxiv icon

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

Add code
Mar 17, 2025
Viaarxiv icon