Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LBS: Loss-aware Bit Sharing for Automatic Model Compression

Feb 15, 2021

Jing Liu, Bohan Zhuang, Peng Chen, Yong Guo, Chunhua Shen, Jianfei Cai, Mingkui Tan

Figure 1 for LBS: Loss-aware Bit Sharing for Automatic Model Compression

Figure 2 for LBS: Loss-aware Bit Sharing for Automatic Model Compression

Figure 3 for LBS: Loss-aware Bit Sharing for Automatic Model Compression

Figure 4 for LBS: Loss-aware Bit Sharing for Automatic Model Compression

Share this with someone who'll enjoy it:

Abstract:Low-bitwidth model compression is an effective method to reduce the model size and computational overhead. Existing compression methods rely on some compression configurations (such as pruning rates, and/or bitwidths), which are often determined manually and not optimal. Some attempts have been made to search them automatically, but the optimization process is often very expensive. To alleviate this, we devise a simple yet effective method named Loss-aware Bit Sharing (LBS) to automatically search for optimal model compression configurations. To this end, we propose a novel single-path model to encode all candidate compression configurations, where a high bitwidth quantized value can be decomposed into the sum of the lowest bitwidth quantized value and a series of re-assignment offsets. We then introduce learnable binary gates to encode the choice of bitwidth, including filter-wise 0-bit for filter pruning. By jointly training the binary gates in conjunction with network parameters, the compression configurations of each layer can be automatically determined. Extensive experiments on both CIFAR-100 and ImageNet show that LBS is able to significantly reduce computational cost while preserving promising performance.

* 22 pages

View paper on

Share this with someone who'll enjoy it:

Title:LBS: Loss-aware Bit Sharing for Automatic Model Compression

Paper and Code