Picture for Runyu Peng

Runyu Peng

Explicit Multi-head Attention for Inter-head Interaction in Large Language Models

Add code
Jan 27, 2026
Viaarxiv icon

How to Set the Batch Size for Large-Scale Pre-training?

Add code
Jan 08, 2026
Viaarxiv icon

AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models

Add code
Feb 24, 2025
Figure 1 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Figure 2 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Figure 3 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Figure 4 for AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models
Viaarxiv icon

Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation

Add code
Jun 20, 2024
Figure 1 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Figure 2 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Figure 3 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Figure 4 for Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation
Viaarxiv icon

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

Add code
Mar 12, 2024
Figure 1 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 2 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 3 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 4 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Viaarxiv icon

Data-freeWeight Compress and Denoise for Large Language Models

Add code
Feb 26, 2024
Viaarxiv icon