Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiuming Liu

SoK: Robustness in Large Language Models against Jailbreak Attacks

May 06, 2026

Feiyue Xu, Hongsheng Hu, Chaoxiang He, Sheng Hang, Hanqing Hu, Xiuming Liu, Yubo Zhao, Zhengyan Zhou, Bin Benjamin Zhu, Shi-Feng Sun(+2 more)

Abstract:Large Language Models (LLMs) have achieved remarkable success but remain highly susceptible to jailbreak attacks, in which adversarial prompts coerce models into generating harmful, unethical, or policy-violating outputs. Such attacks pose real-world risks, eroding safety, trust, and regulatory compliance in high-stakes applications. Although a variety of attack and defense methods have been proposed, existing evaluation practices are inadequate, often relying on narrow metrics like attack success rate that fail to capture the multidimensional nature of LLM security. In this paper, we present a systematic taxonomy of jailbreak attacks and defenses and introduce Security Cube, a unified, multi-dimensional framework for comprehensive evaluation of these techniques. We provide detailed comparison tables of existing attacks and defenses, highlighting key insights and open challenges across the literature. Leveraging Security Cube, we conduct benchmark studies on 13 representative attacks and 5 defenses, establishing a clear view of the current landscape encompassing jailbreak attacks, defenses, automated judges, and LLM vulnerabilities. Based on these evaluations, we distill critical findings, identify unresolved problems, and outline promising research directions for enhancing LLM robustness against jailbreak attacks. Our analysis aims to pave the way towards more robust, interpretable, and trustworthy LLM systems. Our code is available at Code.

* To Appear in the 47th IEEE Symposium on Security and Privacy, May 18-20, 2026

Via

Access Paper or Ask Questions

Robust Prediction when Features are Missing

Dec 24, 2019

Xiuming Liu, Dave Zachariah, Petre Stoica

Figure 1 for Robust Prediction when Features are Missing

Figure 2 for Robust Prediction when Features are Missing

Figure 3 for Robust Prediction when Features are Missing

Figure 4 for Robust Prediction when Features are Missing

Abstract:Predictors are learned using past training data containing features which may be unavailable at the time of prediction. We develop an prediction approach that is robust against unobserved outliers of the missing features, based on the optimality properties of a predictor which has access to these features. The robustness properties of the approach are demonstrated in real and synthetic data.

Via

Access Paper or Ask Questions

Robust Semi-Supervised Learning when Labels are Missing at Random

Nov 28, 2018

Xiuming Liu, Dave Zachariah, Johan Wågberg

Figure 1 for Robust Semi-Supervised Learning when Labels are Missing at Random

Figure 2 for Robust Semi-Supervised Learning when Labels are Missing at Random

Figure 3 for Robust Semi-Supervised Learning when Labels are Missing at Random

Figure 4 for Robust Semi-Supervised Learning when Labels are Missing at Random

Abstract:Semi-supervised learning methods are motivated by the relative paucity of labeled data and aim to utilize large sources of unlabeled data to improve predictive tasks. It has been noted, however, such improvements are not guaranteed in general in some cases the unlabeled data impairs the performance. A fundamental source of error comes from restrictive assumptions about the unlabeled features. In this paper, we develop a semi-supervised learning approach that relaxes such assumptions and is robust with respect to labels missing at random. The approach ensures that uncertainty about the classes is propagated to the unlabeled features in a robust manner. It is applicable using any generative model with associated learning algorithm. We illustrate the approach using both standard synthetic data examples and the MNIST data with unlabeled adversarial examples.

Via

Access Paper or Ask Questions

Composite Gaussian Processes: Scalable Computation and Performance Analysis

Jan 31, 2018

Xiuming Liu, Dave Zachariah, Edith C. H. Ngai

Figure 1 for Composite Gaussian Processes: Scalable Computation and Performance Analysis

Figure 2 for Composite Gaussian Processes: Scalable Computation and Performance Analysis

Figure 3 for Composite Gaussian Processes: Scalable Computation and Performance Analysis

Figure 4 for Composite Gaussian Processes: Scalable Computation and Performance Analysis

Abstract:Gaussian process (GP) models provide a powerful tool for prediction but are computationally prohibitive using large data sets. In such scenarios, one has to resort to approximate methods. We derive an approximation based on a composite likelihood approach using a general belief updating framework, which leads to a recursive computation of the predictor as well as of learning the hyper-parameters. We then provide an analysis of the derived composite GP model in predictive and information-theoretic terms. Finally, we evaluate the approximation with both synthetic data and a real-world application.

Via

Access Paper or Ask Questions