Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Phuoc Pham

CppSATD: A Reusable Self-Admitted Technical Debt Dataset in C++

May 02, 2025

Phuoc Pham, Murali Sridharan, Matteo Esposito, Valentina Lenarduzzi

Abstract:In software development, technical debt (TD) refers to suboptimal implementation choices made by the developers to meet urgent deadlines and limited resources, posing challenges for future maintenance. Self-Admitted Technical Debt (SATD) is a sub-type of TD, representing specific TD instances ``openly admitted'' by the developers and often expressed through source code comments. Previous research on SATD has focused predominantly on the Java programming language, revealing a significant gap in cross-language SATD. Such a narrow focus limits the generalizability of existing findings as well as SATD detection techniques across multiple programming languages. Our work addresses such limitation by introducing CppSATD, a dedicated C++ SATD dataset, comprising over 531,000 annotated comments and their source code contexts. Our dataset can serve as a foundation for future studies that aim to develop SATD detection methods in C++, generalize the existing findings to other languages, or contribute novel insights to cross-language SATD research.

Via

Access Paper or Ask Questions

Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer

Apr 01, 2021

Phuoc Pham, Jacob Abraham, Jaeyong Chung

Figure 1 for Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer

Figure 2 for Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer

Figure 3 for Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer

Figure 4 for Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer

Abstract:Quantizing weights and activations of deep neural networks is essential for deploying them in resource-constrained devices, or cloud platforms for at-scale services. While binarization is a special case of quantization, this extreme case often leads to several training difficulties, and necessitates specialized models and training methods. As a result, recent quantization methods do not provide binarization, thus losing the most resource-efficient option, and quantized and binarized networks have been distinct research areas. We examine binarization difficulties in a quantization framework and find that all we need to enable the binary training are a symmetric quantizer, good initialization, and careful hyperparameter selection. These techniques also lead to substantial improvements in multi-bit quantization. We demonstrate our unified quantization framework, denoted as UniQ, on the ImageNet dataset with various architectures such as ResNet-18,-34 and MobileNetV2. For multi-bit quantization, UniQ outperforms existing methods to achieve the state-of-the-art accuracy. In binarization, the achieved accuracy is comparable to existing state-of-the-art methods even without modifying the original architectures.

Via

Access Paper or Ask Questions