Picture for Aston Zhang

Aston Zhang

Jack

SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing

Add code
Dec 10, 2022
Figure 1 for SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing
Figure 2 for SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing
Figure 3 for SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing
Figure 4 for SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing
Viaarxiv icon

Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork

Add code
Oct 12, 2022
Figure 1 for Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork
Figure 2 for Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork
Figure 3 for Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork
Figure 4 for Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork
Viaarxiv icon

Automatic Chain of Thought Prompting in Large Language Models

Add code
Oct 07, 2022
Figure 1 for Automatic Chain of Thought Prompting in Large Language Models
Figure 2 for Automatic Chain of Thought Prompting in Large Language Models
Figure 3 for Automatic Chain of Thought Prompting in Large Language Models
Figure 4 for Automatic Chain of Thought Prompting in Large Language Models
Viaarxiv icon

Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition

Add code
Jul 04, 2022
Figure 1 for Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition
Figure 2 for Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition
Figure 3 for Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition
Figure 4 for Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition
Viaarxiv icon

Removing Batch Normalization Boosts Adversarial Training

Add code
Jul 04, 2022
Figure 1 for Removing Batch Normalization Boosts Adversarial Training
Figure 2 for Removing Batch Normalization Boosts Adversarial Training
Figure 3 for Removing Batch Normalization Boosts Adversarial Training
Figure 4 for Removing Batch Normalization Boosts Adversarial Training
Viaarxiv icon

MixGen: A New Multi-Modal Data Augmentation

Add code
Jun 16, 2022
Figure 1 for MixGen: A New Multi-Modal Data Augmentation
Figure 2 for MixGen: A New Multi-Modal Data Augmentation
Figure 3 for MixGen: A New Multi-Modal Data Augmentation
Figure 4 for MixGen: A New Multi-Modal Data Augmentation
Viaarxiv icon

Lightweight Convolutional Neural Networks By Hypercomplex Parameterization

Add code
Oct 08, 2021
Figure 1 for Lightweight Convolutional Neural Networks By Hypercomplex Parameterization
Figure 2 for Lightweight Convolutional Neural Networks By Hypercomplex Parameterization
Figure 3 for Lightweight Convolutional Neural Networks By Hypercomplex Parameterization
Figure 4 for Lightweight Convolutional Neural Networks By Hypercomplex Parameterization
Viaarxiv icon

Dive into Deep Learning

Add code
Jun 21, 2021
Figure 1 for Dive into Deep Learning
Viaarxiv icon

Controllable and Diverse Text Generation in E-commerce

Add code
Feb 23, 2021
Figure 1 for Controllable and Diverse Text Generation in E-commerce
Figure 2 for Controllable and Diverse Text Generation in E-commerce
Figure 3 for Controllable and Diverse Text Generation in E-commerce
Figure 4 for Controllable and Diverse Text Generation in E-commerce
Viaarxiv icon

Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters

Add code
Feb 17, 2021
Figure 1 for Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters
Figure 2 for Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters
Figure 3 for Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters
Figure 4 for Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters
Viaarxiv icon