Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs

Sep 04, 2025

Ayush Gupta, Ramneet Kaur, Anirban Roy, Adam D. Cobb, Rama Chellappa, Susmit Jha

Figure 1 for Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs

Figure 2 for Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs

Figure 3 for Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs

Figure 4 for Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs

Share this with someone who'll enjoy it:

Abstract:We propose a novel inference-time out-of-domain (OOD) detection algorithm for specialized large language models (LLMs). Despite achieving state-of-the-art performance on in-domain tasks through fine-tuning, specialized LLMs remain vulnerable to incorrect or unreliable outputs when presented with OOD inputs, posing risks in critical applications. Our method leverages the Inductive Conformal Anomaly Detection (ICAD) framework, using a new non-conformity measure based on the model's dropout tolerance. Motivated by recent findings on polysemanticity and redundancy in LLMs, we hypothesize that in-domain inputs exhibit higher dropout tolerance than OOD inputs. We aggregate dropout tolerance across multiple layers via a valid ensemble approach, improving detection while maintaining theoretical false alarm bounds from ICAD. Experiments with medical-specialized LLMs show that our approach detects OOD inputs better than baseline methods, with AUROC improvements of $2\%$ to $37\%$ when treating OOD datapoints as positives and in-domain test datapoints as negatives.

* Accepted to EMNLP 2025 main conference

View paper on

Share this with someone who'll enjoy it:

Title:Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs

Paper and Code