Picture for Zhenda Xie

Zhenda Xie

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Add code
Jan 11, 2024
Figure 1 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Figure 2 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Figure 3 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Figure 4 for DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Jan 05, 2024
Figure 1 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 2 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 3 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 4 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Viaarxiv icon

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Add code
Oct 26, 2023
Viaarxiv icon

High speed free-space optical communication using standard fiber communication component without optical amplification

Add code
Feb 27, 2023
Figure 1 for High speed free-space optical communication using standard fiber communication component without optical amplification
Figure 2 for High speed free-space optical communication using standard fiber communication component without optical amplification
Figure 3 for High speed free-space optical communication using standard fiber communication component without optical amplification
Figure 4 for High speed free-space optical communication using standard fiber communication component without optical amplification
Viaarxiv icon

On Data Scaling in Masked Image Modeling

Add code
Jun 09, 2022
Figure 1 for On Data Scaling in Masked Image Modeling
Figure 2 for On Data Scaling in Masked Image Modeling
Figure 3 for On Data Scaling in Masked Image Modeling
Figure 4 for On Data Scaling in Masked Image Modeling
Viaarxiv icon

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation

Add code
May 27, 2022
Figure 1 for Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Figure 2 for Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Figure 3 for Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Figure 4 for Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Viaarxiv icon

Revealing the Dark Secrets of Masked Image Modeling

Add code
May 27, 2022
Figure 1 for Revealing the Dark Secrets of Masked Image Modeling
Figure 2 for Revealing the Dark Secrets of Masked Image Modeling
Figure 3 for Revealing the Dark Secrets of Masked Image Modeling
Figure 4 for Revealing the Dark Secrets of Masked Image Modeling
Viaarxiv icon

iCAR: Bridging Image Classification and Image-text Alignment for Visual Recognition

Add code
Apr 22, 2022
Figure 1 for iCAR: Bridging Image Classification and Image-text Alignment for Visual Recognition
Figure 2 for iCAR: Bridging Image Classification and Image-text Alignment for Visual Recognition
Figure 3 for iCAR: Bridging Image Classification and Image-text Alignment for Visual Recognition
Figure 4 for iCAR: Bridging Image Classification and Image-text Alignment for Visual Recognition
Viaarxiv icon

SimMIM: A Simple Framework for Masked Image Modeling

Add code
Nov 18, 2021
Figure 1 for SimMIM: A Simple Framework for Masked Image Modeling
Figure 2 for SimMIM: A Simple Framework for Masked Image Modeling
Figure 3 for SimMIM: A Simple Framework for Masked Image Modeling
Figure 4 for SimMIM: A Simple Framework for Masked Image Modeling
Viaarxiv icon

Swin Transformer V2: Scaling Up Capacity and Resolution

Add code
Nov 18, 2021
Figure 1 for Swin Transformer V2: Scaling Up Capacity and Resolution
Figure 2 for Swin Transformer V2: Scaling Up Capacity and Resolution
Figure 3 for Swin Transformer V2: Scaling Up Capacity and Resolution
Figure 4 for Swin Transformer V2: Scaling Up Capacity and Resolution
Viaarxiv icon