Picture for Deyang Kong

Deyang Kong

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

Add code
May 23, 2025
Viaarxiv icon

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Add code
Mar 03, 2025
Viaarxiv icon