Picture for Juyong Jiang

Juyong Jiang

A Survey on Mixture of Experts

Add code
Jun 26, 2024
Viaarxiv icon

HyperCLOVA X Technical Report

Add code
Apr 13, 2024
Viaarxiv icon

Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts

Add code
Apr 07, 2024
Viaarxiv icon

Feature-Balanced Loss for Long-Tailed Visual Recognition

Add code
May 18, 2023
Figure 1 for Feature-Balanced Loss for Long-Tailed Visual Recognition
Figure 2 for Feature-Balanced Loss for Long-Tailed Visual Recognition
Figure 3 for Feature-Balanced Loss for Long-Tailed Visual Recognition
Figure 4 for Feature-Balanced Loss for Long-Tailed Visual Recognition
Viaarxiv icon

Dynamic Adaptive and Adversarial Graph Convolutional Network for Traffic Forecasting

Add code
Aug 05, 2022
Figure 1 for Dynamic Adaptive and Adversarial Graph Convolutional Network for Traffic Forecasting
Figure 2 for Dynamic Adaptive and Adversarial Graph Convolutional Network for Traffic Forecasting
Figure 3 for Dynamic Adaptive and Adversarial Graph Convolutional Network for Traffic Forecasting
Figure 4 for Dynamic Adaptive and Adversarial Graph Convolutional Network for Traffic Forecasting
Viaarxiv icon

AdaMCT: Adaptive Mixture of CNN-Transformer for Sequential Recommendation

Add code
May 19, 2022
Figure 1 for AdaMCT: Adaptive Mixture of CNN-Transformer for Sequential Recommendation
Figure 2 for AdaMCT: Adaptive Mixture of CNN-Transformer for Sequential Recommendation
Figure 3 for AdaMCT: Adaptive Mixture of CNN-Transformer for Sequential Recommendation
Figure 4 for AdaMCT: Adaptive Mixture of CNN-Transformer for Sequential Recommendation
Viaarxiv icon

Compact Neural Networks via Stacking Designed Basic Units

Add code
May 03, 2022
Figure 1 for Compact Neural Networks via Stacking Designed Basic Units
Figure 2 for Compact Neural Networks via Stacking Designed Basic Units
Figure 3 for Compact Neural Networks via Stacking Designed Basic Units
Figure 4 for Compact Neural Networks via Stacking Designed Basic Units
Viaarxiv icon

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data

Add code
Mar 03, 2022
Figure 1 for Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data
Figure 2 for Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data
Figure 3 for Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data
Figure 4 for Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data
Viaarxiv icon

Sequential Recommendation with Bidirectional Chronological Augmentation of Transformer

Add code
Dec 13, 2021
Figure 1 for Sequential Recommendation with Bidirectional Chronological Augmentation of Transformer
Figure 2 for Sequential Recommendation with Bidirectional Chronological Augmentation of Transformer
Figure 3 for Sequential Recommendation with Bidirectional Chronological Augmentation of Transformer
Figure 4 for Sequential Recommendation with Bidirectional Chronological Augmentation of Transformer
Viaarxiv icon

Cascaded Semantic and Positional Self-Attention Network for Document Classification

Add code
Sep 19, 2020
Figure 1 for Cascaded Semantic and Positional Self-Attention Network for Document Classification
Figure 2 for Cascaded Semantic and Positional Self-Attention Network for Document Classification
Figure 3 for Cascaded Semantic and Positional Self-Attention Network for Document Classification
Figure 4 for Cascaded Semantic and Positional Self-Attention Network for Document Classification
Viaarxiv icon