Alert button
Picture for Edgar Dobriban

Edgar Dobriban

Alert button

Uncertainty in Language Models: Assessment through Rank-Calibration

Add code
Bookmark button
Alert button
Apr 04, 2024
Xinmeng Huang, Shuo Li, Mengxin Yu, Matteo Sesia, Hamed Hassani, Insup Lee, Osbert Bastani, Edgar Dobriban

Viaarxiv icon

Inference in Randomized Least Squares and PCA via Normality of Quadratic Forms

Add code
Bookmark button
Alert button
Apr 01, 2024
Leda Wang, Zhixiang Zhang, Edgar Dobriban

Viaarxiv icon

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Add code
Bookmark button
Alert button
Mar 28, 2024
Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramer, Hamed Hassani, Eric Wong

Viaarxiv icon

Minimax Optimal Fair Classification with Bounded Demographic Disparity

Add code
Bookmark button
Alert button
Mar 27, 2024
Xianli Zeng, Guang Cheng, Edgar Dobriban

Viaarxiv icon

Bayes-Optimal Fair Classification with Linear Disparity Constraints via Pre-, In-, and Post-processing

Add code
Bookmark button
Alert button
Feb 06, 2024
Xianli Zeng, Guang Cheng, Edgar Dobriban

Viaarxiv icon

SymmPI: Predictive Inference for Data with Group Symmetries

Add code
Bookmark button
Alert button
Dec 29, 2023
Edgar Dobriban, Mengxin Yu

Viaarxiv icon

PAC Prediction Sets Under Label Shift

Add code
Bookmark button
Alert button
Oct 19, 2023
Wenwen Si, Sangdon Park, Insup Lee, Edgar Dobriban, Osbert Bastani

Viaarxiv icon

Jailbreaking Black Box Large Language Models in Twenty Queries

Add code
Bookmark button
Alert button
Oct 13, 2023
Patrick Chao, Alexander Robey, Edgar Dobriban, Hamed Hassani, George J. Pappas, Eric Wong

Figure 1 for Jailbreaking Black Box Large Language Models in Twenty Queries
Figure 2 for Jailbreaking Black Box Large Language Models in Twenty Queries
Figure 3 for Jailbreaking Black Box Large Language Models in Twenty Queries
Figure 4 for Jailbreaking Black Box Large Language Models in Twenty Queries
Viaarxiv icon