Picture for Xuemiao Zhang

Xuemiao Zhang

Enhancing LLMs via High-Knowledge Data Selection

Add code
May 20, 2025
Viaarxiv icon

FRAMES: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy

Add code
Feb 08, 2025
Figure 1 for FRAMES: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy
Figure 2 for FRAMES: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy
Figure 3 for FRAMES: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy
Figure 4 for FRAMES: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy
Viaarxiv icon

FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training

Add code
Feb 02, 2025
Figure 1 for FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training
Figure 2 for FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training
Figure 3 for FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training
Figure 4 for FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training
Viaarxiv icon