Picture for Shuhao Gu

Shuhao Gu

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models

Add code
Oct 24, 2024
Viaarxiv icon

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Add code
Oct 24, 2024
Figure 1 for Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data
Figure 2 for Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data
Figure 3 for Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data
Figure 4 for Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data
Viaarxiv icon

ReTok: Replacing Tokenizer to Enhance Representation Efficiency in Large Language Model

Add code
Oct 06, 2024
Viaarxiv icon

Aquila2 Technical Report

Add code
Aug 14, 2024
Viaarxiv icon

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

Add code
Aug 13, 2024
Viaarxiv icon

Addressing the Length Bias Problem in Document-Level Neural Machine Translation

Add code
Nov 20, 2023
Viaarxiv icon

Enhancing Neural Machine Translation with Semantic Units

Add code
Oct 17, 2023
Figure 1 for Enhancing Neural Machine Translation with Semantic Units
Figure 2 for Enhancing Neural Machine Translation with Semantic Units
Figure 3 for Enhancing Neural Machine Translation with Semantic Units
Figure 4 for Enhancing Neural Machine Translation with Semantic Units
Viaarxiv icon

Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions

Add code
Nov 04, 2022
Figure 1 for Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions
Figure 2 for Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions
Figure 3 for Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions
Figure 4 for Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions
Viaarxiv icon

Improving Zero-Shot Multilingual Translation with Universal Representations and Cross-Mappings

Add code
Oct 28, 2022
Figure 1 for Improving Zero-Shot Multilingual Translation with Universal Representations and Cross-Mappings
Figure 2 for Improving Zero-Shot Multilingual Translation with Universal Representations and Cross-Mappings
Figure 3 for Improving Zero-Shot Multilingual Translation with Universal Representations and Cross-Mappings
Figure 4 for Improving Zero-Shot Multilingual Translation with Universal Representations and Cross-Mappings
Viaarxiv icon