Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hsin-Pai Cheng

AutoShrink: A Topology-aware NAS for Discovering Efficient Neural Architecture

Nov 21, 2019

Tunhou Zhang, Hsin-Pai Cheng, Zhenwen Li, Feng Yan, Chengyu Huang, Hai Li, Yiran Chen

Figure 1 for AutoShrink: A Topology-aware NAS for Discovering Efficient Neural Architecture

Figure 2 for AutoShrink: A Topology-aware NAS for Discovering Efficient Neural Architecture

Figure 3 for AutoShrink: A Topology-aware NAS for Discovering Efficient Neural Architecture

Figure 4 for AutoShrink: A Topology-aware NAS for Discovering Efficient Neural Architecture

Abstract:Resource is an important constraint when deploying Deep Neural Networks (DNNs) on mobile and edge devices. Existing works commonly adopt the cell-based search approach, which limits the flexibility of network patterns in learned cell structures. Moreover, due to the topology-agnostic nature of existing works, including both cell-based and node-based approaches, the search process is time consuming and the performance of found architecture may be sub-optimal. To address these problems, we propose AutoShrink, a topology-aware Neural Architecture Search(NAS) for searching efficient building blocks of neural architectures. Our method is node-based and thus can learn flexible network patterns in cell structures within a topological search space. Directed Acyclic Graphs (DAGs) are used to abstract DNN architectures and progressively optimize the cell structure through edge shrinking. As the search space intrinsically reduces as the edges are progressively shrunk, AutoShrink explores more flexible search space with even less search time. We evaluate AutoShrink on image classification and language tasks by crafting ShrinkCNN and ShrinkRNN models. ShrinkCNN is able to achieve up to 48% parameter reduction and save 34% Multiply-Accumulates (MACs) on ImageNet-1K with comparable accuracy of state-of-the-art (SOTA) models. Specifically, both ShrinkCNN and ShrinkRNN are crafted within 1.5 GPU hours, which is 7.2x and 6.7x faster than the crafting time of SOTA CNN and RNN models, respectively.

Via

Access Paper or Ask Questions

SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures

Jun 25, 2019

Hsin-Pai Cheng, Tunhou Zhang, Yukun Yang, Feng Yan, Shiyu Li, Harris Teague, Hai Li, Yiran Chen

Figure 1 for SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures

Figure 2 for SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures

Figure 3 for SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures

Figure 4 for SwiftNet: Using Graph Propagation as Meta-knowledge to Search Highly Representative Neural Architectures

Abstract:Designing neural architectures for edge devices is subject to constraints of accuracy, inference latency, and computational cost. Traditionally, researchers manually craft deep neural networks to meet the needs of mobile devices. Neural Architecture Search (NAS) was proposed to automate the neural architecture design without requiring extensive domain expertise and significant manual efforts. Recent works utilized NAS to design mobile models by taking into account hardware constraints and achieved state-of-the-art accuracy with fewer parameters and less computational cost measured in Multiply-accumulates (MACs). To find highly compact neural architectures, existing works relies on predefined cells and directly applying width multiplier, which may potentially limit the model flexibility, reduce the useful feature map information, and cause accuracy drop. To conquer this issue, we propose GRAM(GRAph propagation as Meta-knowledge) that adopts fine-grained (node-wise) search method and accumulates the knowledge learned in updates into a meta-graph. As a result, GRAM can enable more flexible search space and achieve higher search efficiency. Without the constraints of predefined cell or blocks, we propose a new structure-level pruning method to remove redundant operations in neural architectures. SwiftNet, which is a set of models discovered by GRAM, outperforms MobileNet-V2 by 2.15x higher accuracy density and 2.42x faster with similar accuracy. Compared with FBNet, SwiftNet reduces the search cost by 26x and achieves 2.35x higher accuracy density and 1.47x speedup while preserving similar accuracy. SwiftNetcan obtain 63.28% top-1 accuracy on ImageNet-1K with only 53M MACs and 2.07M parameters. The corresponding inference latency is only 19.09 ms on Google Pixel 1.

Via

Access Paper or Ask Questions

Low-Power Computer Vision: Status, Challenges, Opportunities

Apr 15, 2019

Sergei Alyamkin, Matthew Ardi, Alexander C. Berg, Achille Brighton, Bo Chen, Yiran Chen, Hsin-Pai Cheng, Zichen Fan, Chen Feng, Bo Fu(+34 more)

Figure 1 for Low-Power Computer Vision: Status, Challenges, Opportunities

Figure 2 for Low-Power Computer Vision: Status, Challenges, Opportunities

Figure 3 for Low-Power Computer Vision: Status, Challenges, Opportunities

Figure 4 for Low-Power Computer Vision: Status, Challenges, Opportunities

Abstract:Computer vision has achieved impressive progress in recent years. Meanwhile, mobile phones have become the primary computing platforms for millions of people. In addition to mobile phones, many autonomous systems rely on visual data for making decisions and some of these systems have limited energy (such as unmanned aerial vehicles also called drones and mobile robots). These systems rely on batteries and energy efficiency is critical. This article serves two main purposes: (1) Examine the state-of-the-art for low-power solutions to detect objects in images. Since 2015, the IEEE Annual International Low-Power Image Recognition Challenge (LPIRC) has been held to identify the most energy-efficient computer vision solutions. This article summarizes 2018 winners' solutions. (2) Suggest directions for research as well as opportunities for low-power computer vision.

* Preprint, Accepted by IEEE Journal on Emerging and Selected Topics in Circuits and Systems. arXiv admin note: substantial text overlap with arXiv:1810.01732

Via

Access Paper or Ask Questions

Towards Leveraging the Information of Gradients in Optimization-based Adversarial Attack

Dec 06, 2018

Jingyang Zhang, Hsin-Pai Cheng, Chunpeng Wu, Hai Li, Yiran Chen

Figure 1 for Towards Leveraging the Information of Gradients in Optimization-based Adversarial Attack

Figure 2 for Towards Leveraging the Information of Gradients in Optimization-based Adversarial Attack

Figure 3 for Towards Leveraging the Information of Gradients in Optimization-based Adversarial Attack

Figure 4 for Towards Leveraging the Information of Gradients in Optimization-based Adversarial Attack

Abstract:In recent years, deep neural networks demonstrated state-of-the-art performance in a large variety of tasks and therefore have been adopted in many applications. On the other hand, the latest studies revealed that neural networks are vulnerable to adversarial examples obtained by carefully adding small perturbation to legitimate samples. Based upon the observation, many attack methods were proposed. Among them, the optimization-based CW attack is the most powerful as the produced adversarial samples present much less distortion compared to other methods. The better attacking effect, however, comes at the cost of running more iterations and thus longer computation time to reach desirable results. In this work, we propose to leverage the information of gradients as a guidance during the search of adversaries. More specifically, directly incorporating the gradients into the perturbation can be regarded as a constraint added to the optimization process. We intuitively and empirically prove the rationality of our method in reducing the search space. Our experiments show that compared to the original CW attack, the proposed method requires fewer iterations towards adversarial samples, obtaining a higher success rate and resulting in smaller $\ell_2$ distortion.

Via

Access Paper or Ask Questions

LEASGD: an Efficient and Privacy-Preserving Decentralized Algorithm for Distributed Learning

Nov 27, 2018

Hsin-Pai Cheng, Patrick Yu, Haojing Hu, Feng Yan, Shiyu Li, Hai Li, Yiran Chen

Figure 1 for LEASGD: an Efficient and Privacy-Preserving Decentralized Algorithm for Distributed Learning

Figure 2 for LEASGD: an Efficient and Privacy-Preserving Decentralized Algorithm for Distributed Learning

Figure 3 for LEASGD: an Efficient and Privacy-Preserving Decentralized Algorithm for Distributed Learning

Abstract:Distributed learning systems have enabled training large-scale models over large amount of data in significantly shorter time. In this paper, we focus on decentralized distributed deep learning systems and aim to achieve differential privacy with good convergence rate and low communication cost. To achieve this goal, we propose a new learning algorithm LEASGD (Leader-Follower Elastic Averaging Stochastic Gradient Descent), which is driven by a novel Leader-Follower topology and a differential privacy model.We provide a theoretical analysis of the convergence rate and the trade-off between the performance and privacy in the private setting.The experimental results show that LEASGD outperforms state-of-the-art decentralized learning algorithm DPSGD by achieving steadily lower loss within the same iterations and by reducing the communication cost by 30%. In addition, LEASGD spends less differential privacy budget and has higher final accuracy result than DPSGD under private setting.

Via

Access Paper or Ask Questions

Differentiable Fine-grained Quantization for Deep Neural Network Compression

Oct 20, 2018

Hsin-Pai Cheng, Yuanjun Huang, Xuyang Guo, Yifei Huang, Feng Yan, Hai Li, Yiran Chen

Figure 1 for Differentiable Fine-grained Quantization for Deep Neural Network Compression

Figure 2 for Differentiable Fine-grained Quantization for Deep Neural Network Compression

Figure 3 for Differentiable Fine-grained Quantization for Deep Neural Network Compression

Abstract:Neural networks have shown great performance in cognitive tasks. When deploying network models on mobile devices with limited resources, weight quantization has been widely adopted. Binary quantization obtains the highest compression but usually results in big accuracy drop. In practice, 8-bit or 16-bit quantization is often used aiming at maintaining the same accuracy as the original 32-bit precision. We observe different layers have different accuracy sensitivity of quantization. Thus judiciously selecting different precision for different layers/structures can potentially produce more efficient models compared to traditional quantization methods by striking a better balance between accuracy and compression rate. In this work, we propose a fine-grained quantization approach for deep neural network compression by relaxing the search space of quantization bitwidth from discrete to a continuous domain. The proposed approach applies gradient descend based optimization to generate a mixed-precision quantization scheme that outperforms the accuracy of traditional quantization methods under the same compression rate.

Via

Access Paper or Ask Questions

2018 Low-Power Image Recognition Challenge

Oct 03, 2018

Sergei Alyamkin, Matthew Ardi, Achille Brighton, Alexander C. Berg, Yiran Chen, Hsin-Pai Cheng, Bo Chen, Zichen Fan, Chen Feng, Bo Fu(+31 more)

Abstract:The Low-Power Image Recognition Challenge (LPIRC, https://rebootingcomputing.ieee.org/lpirc) is an annual competition started in 2015. The competition identifies the best technologies that can classify and detect objects in images efficiently (short execution time and low energy consumption) and accurately (high precision). Over the four years, the winners' scores have improved more than 24 times. As computer vision is widely used in many battery-powered systems (such as drones and mobile phones), the need for low-power computer vision will become increasingly important. This paper summarizes LPIRC 2018 by describing the three different tracks and the winners' solutions.

* 13 pages, workshop in 2018 CVPR, competition, low-power, image recognition

Via

Access Paper or Ask Questions

MAT: A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks

May 11, 2018

Chang Song, Hsin-Pai Cheng, Huanrui Yang, Sicheng Li, Chunpeng Wu, Qing Wu, Hai Li, Yiran Chen

Figure 1 for MAT: A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks

Figure 2 for MAT: A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks

Figure 3 for MAT: A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks

Figure 4 for MAT: A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks

Abstract:Some recent works revealed that deep neural networks (DNNs) are vulnerable to so-called adversarial attacks where input examples are intentionally perturbed to fool DNNs. In this work, we revisit the DNN training process that includes adversarial examples into the training dataset so as to improve DNN's resilience to adversarial attacks, namely, adversarial training. Our experiments show that different adversarial strengths, i.e., perturbation levels of adversarial examples, have different working zones to resist the attack. Based on the observation, we propose a multi-strength adversarial training method (MAT) that combines the adversarial training examples with different adversarial strengths to defend adversarial attacks. Two training structures - mixed MAT and parallel MAT - are developed to facilitate the tradeoffs between training time and memory occupation. Our results show that MAT can substantially minimize the accuracy degradation of deep learning systems to adversarial attacks on MNIST, CIFAR-10, CIFAR-100, and SVHN.

* 6 pages, 4 figures, 2 tables

Via

Access Paper or Ask Questions