Picture for Xipeng Qiu

Xipeng Qiu

Balanced Data Sampling for Language Model Training with Clustering

Add code
Feb 22, 2024
Figure 1 for Balanced Data Sampling for Language Model Training with Clustering
Figure 2 for Balanced Data Sampling for Language Model Training with Clustering
Figure 3 for Balanced Data Sampling for Language Model Training with Clustering
Figure 4 for Balanced Data Sampling for Language Model Training with Clustering
Viaarxiv icon

LongWanjuan: Towards Systematic Measurement for Long Text Quality

Add code
Feb 22, 2024
Viaarxiv icon

Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge

Add code
Feb 22, 2024
Figure 1 for Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge
Figure 2 for Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge
Figure 3 for Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge
Figure 4 for Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge
Viaarxiv icon

Turn Waste into Worth: Rectifying Top-$k$ Router of MoE

Add code
Feb 21, 2024
Figure 1 for Turn Waste into Worth: Rectifying Top-$k$ Router of MoE
Figure 2 for Turn Waste into Worth: Rectifying Top-$k$ Router of MoE
Figure 3 for Turn Waste into Worth: Rectifying Top-$k$ Router of MoE
Figure 4 for Turn Waste into Worth: Rectifying Top-$k$ Router of MoE
Viaarxiv icon

Code Needs Comments: Enhancing Code LLMs with Comment Augmentation

Add code
Feb 20, 2024
Figure 1 for Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Figure 2 for Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Figure 3 for Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Figure 4 for Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Viaarxiv icon

Identifying Semantic Induction Heads to Understand In-Context Learning

Add code
Feb 20, 2024
Figure 1 for Identifying Semantic Induction Heads to Understand In-Context Learning
Figure 2 for Identifying Semantic Induction Heads to Understand In-Context Learning
Figure 3 for Identifying Semantic Induction Heads to Understand In-Context Learning
Figure 4 for Identifying Semantic Induction Heads to Understand In-Context Learning
Viaarxiv icon

Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT

Add code
Feb 19, 2024
Figure 1 for Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT
Figure 2 for Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT
Figure 3 for Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT
Figure 4 for Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT
Viaarxiv icon

LLM can Achieve Self-Regulation via Hyperparameter Aware Generation

Add code
Feb 17, 2024
Viaarxiv icon

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Add code
Feb 09, 2024
Figure 1 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Figure 2 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Figure 3 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Figure 4 for InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
Viaarxiv icon

MouSi: Poly-Visual-Expert Vision-Language Models

Add code
Jan 30, 2024
Viaarxiv icon