Picture for Shunfeng Zhou

Shunfeng Zhou

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Jan 05, 2024
Figure 1 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 2 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 3 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 4 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Viaarxiv icon

LAW: Learning to Auto Weight

Add code
May 27, 2019
Figure 1 for LAW: Learning to Auto Weight
Figure 2 for LAW: Learning to Auto Weight
Figure 3 for LAW: Learning to Auto Weight
Figure 4 for LAW: Learning to Auto Weight
Viaarxiv icon

Dynamic Multi-path Neural Network

Add code
Apr 07, 2019
Figure 1 for Dynamic Multi-path Neural Network
Figure 2 for Dynamic Multi-path Neural Network
Figure 3 for Dynamic Multi-path Neural Network
Figure 4 for Dynamic Multi-path Neural Network
Viaarxiv icon

Correlation Congruence for Knowledge Distillation

Add code
Apr 03, 2019
Figure 1 for Correlation Congruence for Knowledge Distillation
Figure 2 for Correlation Congruence for Knowledge Distillation
Figure 3 for Correlation Congruence for Knowledge Distillation
Figure 4 for Correlation Congruence for Knowledge Distillation
Viaarxiv icon