Picture for Lingjun Li

Lingjun Li

Yuan 2.0-M32: Mixture of Experts with Attention Router

Add code
May 29, 2024
Figure 1 for Yuan 2.0-M32: Mixture of Experts with Attention Router
Figure 2 for Yuan 2.0-M32: Mixture of Experts with Attention Router
Figure 3 for Yuan 2.0-M32: Mixture of Experts with Attention Router
Figure 4 for Yuan 2.0-M32: Mixture of Experts with Attention Router
Viaarxiv icon

YUAN 2.0: A Large Language Model with Localized Filtering-based Attention

Add code
Dec 04, 2023
Figure 1 for YUAN 2.0: A Large Language Model with Localized Filtering-based Attention
Figure 2 for YUAN 2.0: A Large Language Model with Localized Filtering-based Attention
Figure 3 for YUAN 2.0: A Large Language Model with Localized Filtering-based Attention
Figure 4 for YUAN 2.0: A Large Language Model with Localized Filtering-based Attention
Viaarxiv icon