Alert button

RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content

Mar 19, 2024
Zhuowen Yuan, Zidi Xiong, Yi Zeng, Ning Yu, Ruoxi Jia, Dawn Song, Bo Li

Figure 1 for RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
Figure 2 for RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
Figure 3 for RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content
Figure 4 for RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: