Picture for Runyu Peng

Runyu Peng

Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation

Add code
Jun 20, 2024
Viaarxiv icon

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

Add code
Mar 12, 2024
Figure 1 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 2 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 3 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 4 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Viaarxiv icon

Data-freeWeight Compress and Denoise for Large Language Models

Add code
Feb 26, 2024
Viaarxiv icon