Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ruizhou Ding

Shawn

AdaScale: Towards Real-time Video Object Detection Using Adaptive Scaling

Feb 08, 2019

Ting-Wu Chin, Ruizhou Ding, Diana Marculescu

Figure 1 for AdaScale: Towards Real-time Video Object Detection Using Adaptive Scaling

Figure 2 for AdaScale: Towards Real-time Video Object Detection Using Adaptive Scaling

Figure 3 for AdaScale: Towards Real-time Video Object Detection Using Adaptive Scaling

Figure 4 for AdaScale: Towards Real-time Video Object Detection Using Adaptive Scaling

Abstract:In vision-enabled autonomous systems such as robots and autonomous cars, video object detection plays a crucial role, and both its speed and accuracy are important factors to provide reliable operation. The key insight we show in this paper is that speed and accuracy are not necessarily a trade-off when it comes to image scaling. Our results show that re-scaling the image to a lower resolution will sometimes produce better accuracy. Based on this observation, we propose a novel approach, dubbed AdaScale, which adaptively selects the input image scale that improves both accuracy and speed for video object detection. To this end, our results on ImageNet VID and mini YouTube-BoundingBoxes datasets demonstrate 1.3 points and 2.7 points mAP improvement with 1.6x and 1.8x speedup, respectively. Additionally, we improve state-of-the-art video acceleration work by an extra 1.25x speedup with slightly better mAP on ImageNet VID dataset.

* Accepted to SysML 2019 (http://www.sysml.cc/) as oral contribution

Via

Access Paper or Ask Questions

Understanding the Impact of Label Granularity on CNN-based Image Classification

Jan 21, 2019

Zhuo Chen, Ruizhou Ding, Ting-Wu Chin, Diana Marculescu

Figure 1 for Understanding the Impact of Label Granularity on CNN-based Image Classification

Figure 2 for Understanding the Impact of Label Granularity on CNN-based Image Classification

Figure 3 for Understanding the Impact of Label Granularity on CNN-based Image Classification

Figure 4 for Understanding the Impact of Label Granularity on CNN-based Image Classification

Abstract:In recent years, supervised learning using Convolutional Neural Networks (CNNs) has achieved great success in image classification tasks, and large scale labeled datasets have contributed significantly to this achievement. However, the definition of a label is often application dependent. For example, an image of a cat can be labeled as "cat" or perhaps more specifically "Persian cat." We refer to this as label granularity. In this paper, we conduct extensive experiments using various datasets to demonstrate and analyze how and why training based on fine-grain labeling, such as "Persian cat" can improve CNN accuracy on classifying coarse-grain classes, in this case "cat." The experimental results show that training CNNs with fine-grain labels improves both network's optimization and generalization capabilities, as intuitively it encourages the network to learn more features, and hence increases classification accuracy on coarse-grain classes under all datasets considered. Moreover, fine-grain labels enhance data efficiency in CNN training. For example, a CNN trained with fine-grain labels and only 40% of the total training data can achieve higher accuracy than a CNN trained with the full training dataset and coarse-grain labels. These results point to two possible applications of this work: (i) with sufficient human resources, one can improve CNN performance by re-labeling the dataset with fine-grain labels, and (ii) with limited human resources, to improve CNN performance, rather than collecting more training data, one may instead use fine-grain labels for the dataset. We further propose a metric called Average Confusion Ratio to characterize the effectiveness of fine-grain labeling, and show its use through extensive experimentation. Code is available at https://github.com/cmu-enyac/Label-Granularity.

* 10 pages. Accepted at DSBDA with ICDM

Via

Access Paper or Ask Questions

LightNN: Filling the Gap between Conventional Deep Neural Networks and Binarized Networks

Dec 02, 2017

Ruizhou Ding, Zeye Liu, Rongye Shi, Diana Marculescu, R. D. Blanton

Figure 1 for LightNN: Filling the Gap between Conventional Deep Neural Networks and Binarized Networks

Figure 2 for LightNN: Filling the Gap between Conventional Deep Neural Networks and Binarized Networks

Figure 3 for LightNN: Filling the Gap between Conventional Deep Neural Networks and Binarized Networks

Figure 4 for LightNN: Filling the Gap between Conventional Deep Neural Networks and Binarized Networks

Abstract:Application-specific integrated circuit (ASIC) implementations for Deep Neural Networks (DNNs) have been adopted in many systems because of their higher classification speed. However, although they may be characterized by better accuracy, larger DNNs require significant energy and area, thereby limiting their wide adoption. The energy consumption of DNNs is driven by both memory accesses and computation. Binarized Neural Networks (BNNs), as a trade-off between accuracy and energy consumption, can achieve great energy reduction, and have good accuracy for large DNNs due to its regularization effect. However, BNNs show poor accuracy when a smaller DNN configuration is adopted. In this paper, we propose a new DNN model, LightNN, which replaces the multiplications to one shift or a constrained number of shifts and adds. For a fixed DNN configuration, LightNNs have better accuracy at a slight energy increase than BNNs, yet are more energy efficient with only slightly less accuracy than conventional DNNs. Therefore, LightNNs provide more options for hardware designers to make trade-offs between accuracy and energy. Moreover, for large DNN configurations, LightNNs have a regularization effect, making them better in accuracy than conventional DNNs. These conclusions are verified by experiment using the MNIST and CIFAR-10 datasets for different DNN configurations.

Via

Access Paper or Ask Questions