Picture for Jiawei Hong

Jiawei Hong

Not All Documents Are What You Need for Extracting Instruction Tuning Data

Add code
May 18, 2025
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

Add code
Mar 12, 2024
Figure 1 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 2 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 3 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 4 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Viaarxiv icon

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Add code
Feb 09, 2024
Figure 1 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Figure 2 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Figure 3 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Figure 4 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Viaarxiv icon

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way

Add code
Dec 01, 2023
Figure 1 for CoLLiE: Collaborative Training of Large Language Models in an Efficient Way
Figure 2 for CoLLiE: Collaborative Training of Large Language Models in an Efficient Way
Figure 3 for CoLLiE: Collaborative Training of Large Language Models in an Efficient Way
Figure 4 for CoLLiE: Collaborative Training of Large Language Models in an Efficient Way
Viaarxiv icon