Alert button
Picture for Yuhui Xu

Yuhui Xu

Alert button

Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

Add code
Bookmark button
Alert button
Feb 22, 2024
Xudong Lu, Qi Liu, Yuhui Xu, Aojun Zhou, Siyuan Huang, Bo Zhang, Junchi Yan, Hongsheng Li

Viaarxiv icon

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Add code
Bookmark button
Alert button
Sep 26, 2023
Yuhui Xu, Lingxi Xie, Xiaotao Gu, Xin Chen, Heng Chang, Hengheng Zhang, Zhensu Chen, Xiaopeng Zhang, Qi Tian

Figure 1 for QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Figure 2 for QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Figure 3 for QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Figure 4 for QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Viaarxiv icon

Batch Normalization with Enhanced Linear Transformation

Add code
Bookmark button
Alert button
Nov 28, 2020
Yuhui Xu, Lingxi Xie, Cihang Xie, Jieru Mei, Siyuan Qiao, Wei Shen, Hongkai Xiong, Alan Yuille

Figure 1 for Batch Normalization with Enhanced Linear Transformation
Figure 2 for Batch Normalization with Enhanced Linear Transformation
Figure 3 for Batch Normalization with Enhanced Linear Transformation
Figure 4 for Batch Normalization with Enhanced Linear Transformation
Viaarxiv icon

Weight-Sharing Neural Architecture Search: A Battle to Shrink the Optimization Gap

Add code
Bookmark button
Alert button
Aug 05, 2020
Lingxi Xie, Xin Chen, Kaifeng Bi, Longhui Wei, Yuhui Xu, Zhengsu Chen, Lanfei Wang, An Xiao, Jianlong Chang, Xiaopeng Zhang, Qi Tian

Figure 1 for Weight-Sharing Neural Architecture Search: A Battle to Shrink the Optimization Gap
Figure 2 for Weight-Sharing Neural Architecture Search: A Battle to Shrink the Optimization Gap
Figure 3 for Weight-Sharing Neural Architecture Search: A Battle to Shrink the Optimization Gap
Figure 4 for Weight-Sharing Neural Architecture Search: A Battle to Shrink the Optimization Gap
Viaarxiv icon

TRP: Trained Rank Pruning for Efficient Deep Neural Networks

Add code
Bookmark button
Alert button
Apr 30, 2020
Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong

Figure 1 for TRP: Trained Rank Pruning for Efficient Deep Neural Networks
Figure 2 for TRP: Trained Rank Pruning for Efficient Deep Neural Networks
Figure 3 for TRP: Trained Rank Pruning for Efficient Deep Neural Networks
Figure 4 for TRP: Trained Rank Pruning for Efficient Deep Neural Networks
Viaarxiv icon

Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks

Add code
Bookmark button
Alert button
Apr 17, 2020
Xin Chen, Lingxi Xie, Jun Wu, Longhui Wei, Yuhui Xu, Qi Tian

Figure 1 for Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks
Figure 2 for Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks
Figure 3 for Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks
Figure 4 for Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks
Viaarxiv icon

Latency-Aware Differentiable Neural Architecture Search

Add code
Bookmark button
Alert button
Jan 17, 2020
Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Bowen Shi, Qi Tian, Hongkai Xiong

Figure 1 for Latency-Aware Differentiable Neural Architecture Search
Figure 2 for Latency-Aware Differentiable Neural Architecture Search
Figure 3 for Latency-Aware Differentiable Neural Architecture Search
Figure 4 for Latency-Aware Differentiable Neural Architecture Search
Viaarxiv icon

Trained Rank Pruning for Efficient Deep Neural Networks

Add code
Bookmark button
Alert button
Oct 11, 2019
Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Wenrui Dai, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong

Figure 1 for Trained Rank Pruning for Efficient Deep Neural Networks
Viaarxiv icon

Traned Rank Pruning for Efficient Deep Neural Networks

Add code
Bookmark button
Alert button
Oct 09, 2019
Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Wenrui Dai, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong

Figure 1 for Traned Rank Pruning for Efficient Deep Neural Networks
Viaarxiv icon

PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search

Add code
Bookmark button
Alert button
Jul 12, 2019
Yuhui Xu, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Guo-Jun Qi, Qi Tian, Hongkai Xiong

Figure 1 for PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search
Figure 2 for PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search
Figure 3 for PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search
Figure 4 for PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search
Viaarxiv icon