Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gagandeep Singh

Specification Generation for Neural Networks in Systems

Dec 04, 2024

Isha Chaudhary, Shuyi Lin, Cheng Tan, Gagandeep Singh

Figure 1 for Specification Generation for Neural Networks in Systems

Figure 2 for Specification Generation for Neural Networks in Systems

Figure 3 for Specification Generation for Neural Networks in Systems

Figure 4 for Specification Generation for Neural Networks in Systems

Abstract:Specifications - precise mathematical representations of correct domain-specific behaviors - are crucial to guarantee the trustworthiness of computer systems. With the increasing development of neural networks as computer system components, specifications gain more importance as they can be used to regulate the behaviors of these black-box models. Traditionally, specifications are designed by domain experts based on their intuition of correct behavior. However, this is labor-intensive and hence not a scalable approach as computer system applications diversify. We hypothesize that the traditional (aka reference) algorithms that neural networks replace for higher performance can act as effective proxies for correct behaviors of the models, when available. This is because they have been used and tested for long enough to encode several aspects of the trustworthy/correct behaviors in the underlying domain. Driven by our hypothesis, we develop a novel automated framework, SpecTRA to generate specifications for neural networks using references. We formulate specification generation as an optimization problem and solve it with observations of reference behaviors. SpecTRA clusters similar observations into compact specifications. We present specifications generated by SpecTRA for neural networks in adaptive bit rate and congestion control algorithms. Our specifications show evidence of being correct and matching intuition. Moreover, we use our specifications to show several unknown vulnerabilities of the SOTA models for computer systems.

Via

Access Paper or Ask Questions

TopoTxR: A topology-guided deep convolutional network for breast parenchyma learning on DCE-MRIs

Nov 05, 2024

Fan Wang, Zhilin Zou, Nicole Sakla, Luke Partyka, Nil Rawal, Gagandeep Singh, Wei Zhao, Haibin Ling, Chuan Huang, Prateek Prasanna(+1 more)

Figure 1 for TopoTxR: A topology-guided deep convolutional network for breast parenchyma learning on DCE-MRIs

Figure 2 for TopoTxR: A topology-guided deep convolutional network for breast parenchyma learning on DCE-MRIs

Figure 3 for TopoTxR: A topology-guided deep convolutional network for breast parenchyma learning on DCE-MRIs

Figure 4 for TopoTxR: A topology-guided deep convolutional network for breast parenchyma learning on DCE-MRIs

Abstract:Characterization of breast parenchyma in dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is a challenging task owing to the complexity of underlying tissue structures. Existing quantitative approaches, like radiomics and deep learning models, lack explicit quantification of intricate and subtle parenchymal structures, including fibroglandular tissue. To address this, we propose a novel topological approach that explicitly extracts multi-scale topological structures to better approximate breast parenchymal structures, and then incorporates these structures into a deep-learning-based prediction model via an attention mechanism. Our topology-informed deep learning model, \emph{TopoTxR}, leverages topology to provide enhanced insights into tissues critical for disease pathophysiology and treatment response. We empirically validate \emph{TopoTxR} using the VICTRE phantom breast dataset, showing that the topological structures extracted by our model effectively approximate the breast parenchymal structures. We further demonstrate \emph{TopoTxR}'s efficacy in predicting response to neoadjuvant chemotherapy. Our qualitative and quantitative analyses suggest differential topological behavior of breast tissue in treatment-na\"ive imaging, in patients who respond favorably to therapy as achieving pathological complete response (pCR) versus those who do not. In a comparative analysis with several baselines on the publicly available I-SPY 1 dataset (N=161, including 47 patients with pCR and 114 without) and the Rutgers proprietary dataset (N=120, with 69 patients achieving pCR and 51 not), \emph{TopoTxR} demonstrates a notable improvement, achieving a 2.6\% increase in accuracy and a 4.6\% enhancement in AUC compared to the state-of-the-art method.

* Volume 99, 2025, 103373
* 22 pages, 8 figures, 8 tables, accepted by Medical Image Analysis ( https://www.sciencedirect.com/science/article/abs/pii/S1361841524002986 )

Via

Access Paper or Ask Questions

Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment

Nov 05, 2024

Jason Vega, Junsheng Huang, Gaokai Zhang, Hangoo Kang, Minjia Zhang, Gagandeep Singh

Figure 1 for Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment

Figure 2 for Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment

Figure 3 for Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment

Figure 4 for Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment

Abstract:Safety alignment of Large Language Models (LLMs) has recently become a critical objective of model developers. In response, a growing body of work has been investigating how safety alignment can be bypassed through various jailbreaking methods, such as adversarial attacks. However, these jailbreak methods can be rather costly or involve a non-trivial amount of creativity and effort, introducing the assumption that malicious users are high-resource or sophisticated. In this paper, we study how simple random augmentations to the input prompt affect safety alignment effectiveness in state-of-the-art LLMs, such as Llama 3 and Qwen 2. We perform an in-depth evaluation of 17 different models and investigate the intersection of safety under random augmentations with multiple dimensions: augmentation type, model size, quantization, fine-tuning-based defenses, and decoding strategies (e.g., sampling temperature). We show that low-resource and unsophisticated attackers, i.e. $\textit{stochastic monkeys}$, can significantly improve their chances of bypassing alignment with just 25 random augmentations per prompt.

* Under peer review

Via

Access Paper or Ask Questions

ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

Oct 31, 2024

Yuchen Yang, Shubham Ugare, Yifan Zhao, Gagandeep Singh, Sasa Misailovic

Figure 1 for ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

Figure 2 for ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

Figure 3 for ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

Figure 4 for ARQ: A Mixed-Precision Quantization Framework for Accurate and Certifiably Robust DNNs

Abstract:Mixed precision quantization has become an important technique for enabling the execution of deep neural networks (DNNs) on limited resource computing platforms. Traditional quantization methods have primarily concentrated on maintaining neural network accuracy, either ignoring the impact of quantization on the robustness of the network, or using only empirical techniques for improving robustness. In contrast, techniques for robustness certification, which can provide strong guarantees about the robustness of DNNs have not been used during quantization due to their high computation cost. This paper introduces ARQ, an innovative mixed-precision quantization method that not only preserves the clean accuracy of the smoothed classifiers but also maintains their certified robustness. ARQ uses reinforcement learning to find accurate and robust DNN quantization, while efficiently leveraging randomized smoothing, a popular class of statistical DNN verification algorithms, to guide the search process. We compare ARQ with multiple state-of-the-art quantization techniques on several DNN architectures commonly used in quantization studies: ResNet-20 on CIFAR-10, ResNet-50 on ImageNet, and MobileNetV2 on ImageNet. We demonstrate that ARQ consistently performs better than these baselines across all the benchmarks and the input perturbation levels. In many cases, the performance of ARQ quantized networks can reach that of the original DNN with floating-point weights, but with only 1.5% instructions.

Via

Access Paper or Ask Questions

Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks

Oct 25, 2024

Yinglun Xu, Zhiwei Wang, Gagandeep Singh

Figure 1 for Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks

Figure 2 for Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks

Figure 3 for Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks

Figure 4 for Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks

Abstract:Thompson sampling is one of the most popular learning algorithms for online sequential decision-making problems and has rich real-world applications. However, current Thompson sampling algorithms are limited by the assumption that the rewards received are uncorrupted, which may not be true in real-world applications where adversarial reward poisoning exists. To make Thompson sampling more reliable, we want to make it robust against adversarial reward poisoning. The main challenge is that one can no longer compute the actual posteriors for the true reward, as the agent can only observe the rewards after corruption. In this work, we solve this problem by computing pseudo-posteriors that are less likely to be manipulated by the attack. We propose robust algorithms based on Thompson sampling for the popular stochastic and contextual linear bandit settings in both cases where the agent is aware or unaware of the budget of the attacker. We theoretically show that our algorithms guarantee near-optimal regret under any attack strategy.

Via

Access Paper or Ask Questions

IterGen: Iterative Structured LLM Generation

Oct 09, 2024

Shubham Ugare, Rohan Gumaste, Tarun Suresh, Gagandeep Singh, Sasa Misailovic

Figure 1 for IterGen: Iterative Structured LLM Generation

Figure 2 for IterGen: Iterative Structured LLM Generation

Figure 3 for IterGen: Iterative Structured LLM Generation

Figure 4 for IterGen: Iterative Structured LLM Generation

Abstract:Large Language Models (LLMs) are widely used for tasks such as natural language and code generation. Still, their outputs often suffer from issues like privacy violations, and semantically inaccurate code generation. Current libraries for LLM generation rely on left-to-right decoding without systematic support for backtracking, limiting the ability to correct or refine outputs mid-generation. To address this issue, we introduce IterGen, an intuitive framework for iterative, grammar-guided LLM generation that enables users to move both forward and backward within the generated output based on grammar symbols. By leveraging a symbol-to-position mapping, IterGen ensures efficient and structured generation while allowing for corrections during the process. We demonstrate IterGen's effectiveness in two important applications: reducing privacy leakage in LLM outputs and improving the accuracy of LLM-generated SQL queries. Our code is available at https://github.com/uiuc-arc/itergen

Via

Access Paper or Ask Questions

Towards Universal Certified Robustness with Multi-Norm Training

Oct 03, 2024

Enyi Jiang, Gagandeep Singh

Figure 1 for Towards Universal Certified Robustness with Multi-Norm Training

Figure 2 for Towards Universal Certified Robustness with Multi-Norm Training

Figure 3 for Towards Universal Certified Robustness with Multi-Norm Training

Figure 4 for Towards Universal Certified Robustness with Multi-Norm Training

Abstract:Existing certified training methods can only train models to be robust against a certain perturbation type (e.g. $l_\infty$ or $l_2$). However, an $l_\infty$ certifiably robust model may not be certifiably robust against $l_2$ perturbation (and vice versa) and also has low robustness against other perturbations (e.g. geometric transformation). To this end, we propose the first multi-norm certified training framework \textbf{CURE}, consisting of a new $l_2$ deterministic certified training defense and several multi-norm certified training methods, to attain better \emph{union robustness} when training from scratch or fine-tuning a pre-trained certified model. Further, we devise bound alignment and connect natural training with certified training for better union robustness. Compared with SOTA certified training, \textbf{CURE} improves union robustness up to $22.8\%$ on MNIST, $23.9\%$ on CIFAR-10, and $8.0\%$ on TinyImagenet. Further, it leads to better generalization on a diverse set of challenging unseen geometric perturbations, up to $6.8\%$ on CIFAR-10. Overall, our contributions pave a path towards \textit{universal certified robustness}.

Via

Access Paper or Ask Questions

Optimal Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning

Jun 14, 2024

Yinglun Xu, David Zhu, Rohan Gumastate, Gagandeep Singh

Figure 1 for Optimal Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning

Figure 2 for Optimal Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning

Figure 3 for Optimal Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning

Figure 4 for Optimal Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning

Abstract:Offline reinforcement learning has become one of the most practical RL settings. A recent success story has been RLHF, offline preference-based RL (PBRL) with preference from humans. However, most existing works on offline RL focus on the standard setting with scalar reward feedback. It remains unknown how to universally transfer the existing rich understanding of offline RL from the reward-based to the preference-based setting. In this work, we propose a general framework to bridge this gap. Our key insight is transforming preference feedback to scalar rewards via optimal reward labeling (ORL), and then any reward-based offline RL algorithms can be applied to the dataset with the reward labels. We theoretically show the connection between several recent PBRL techniques and our framework combined with specific offline RL algorithms in terms of how they utilize the preference signals. By combining reward labeling with different algorithms, our framework can lead to new and potentially more efficient offline PBRL algorithms. We empirically test our framework on preference datasets based on the standard D4RL benchmark. When combined with a variety of efficient reward-based offline RL algorithms, the learning result achieved under our framework is comparable to training the same algorithm on the dataset with actual rewards in many cases and better than the recent PBRL baselines in most cases.

Via

Access Paper or Ask Questions

Enhancing Sign Language Detection through Mediapipe and Convolutional Neural Networks (CNN)

Jun 06, 2024

Aditya Raj Verma, Gagandeep Singh, Karnim Meghwal, Banawath Ramji, Praveen Kumar Dadheech

Abstract:This research combines MediaPipe and CNNs for the efficient and accurate interpretation of ASL dataset for the real-time detection of sign language. The system presented here captures and processes hands' gestures in real time. the intended purpose was to create a very easy, accurate, and fast way of entering commands without the necessity of touching something.MediaPipe supports one of the powerful frameworks in real-time hand tracking capabilities for the ability to capture and preprocess hand movements, which increases the accuracy of the gesture recognition system. Actually, the integration of CNN with the MediaPipe results in higher efficiency in using the model of real-time processing.The accuracy achieved by the model on ASL datasets is 99.12\%.The model was tested using American Sign Language (ASL) datasets. The results were then compared to those of existing methods to evaluate how well it performed, using established evaluation techniques. The system will have applications in the communication, education, and accessibility domains. Making systems such as described in this paper even better will assist people with hearing impairment and make things accessible to them. We tested the recognition and translation performance on an ASL dataset and achieved better accuracy over previous models.It is meant to the research is to identify the characters that American signs recognize using hand images taken from a web camera by based on mediapipe and CNNs

Via

Access Paper or Ask Questions

Quantitative Certification of Bias in Large Language Models

May 29, 2024

Isha Chaudhary, Qian Hu, Manoj Kumar, Morteza Ziyadi, Rahul Gupta, Gagandeep Singh

Figure 1 for Quantitative Certification of Bias in Large Language Models

Figure 2 for Quantitative Certification of Bias in Large Language Models

Figure 3 for Quantitative Certification of Bias in Large Language Models

Figure 4 for Quantitative Certification of Bias in Large Language Models

Abstract:Large Language Models (LLMs) can produce responses that exhibit social biases and support stereotypes. However, conventional benchmarking is insufficient to thoroughly evaluate LLM bias, as it can not scale to large sets of prompts and provides no guarantees. Therefore, we propose a novel certification framework QuaCer-B (Quantitative Certification of Bias) that provides formal guarantees on obtaining unbiased responses from target LLMs under large sets of prompts. A certificate consists of high-confidence bounds on the probability of obtaining biased responses from the LLM for any set of prompts containing sensitive attributes, sampled from a distribution. We illustrate the bias certification in LLMs for prompts with various prefixes drawn from given distributions. We consider distributions of random token sequences, mixtures of manual jailbreaks, and jailbreaks in the LLM's embedding space to certify its bias. We certify popular LLMs with QuaCer-B and present novel insights into their biases.

Via

Access Paper or Ask Questions