Picture for Chaowei Xiao

Chaowei Xiao

RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process

Add code
Oct 11, 2024
Figure 1 for RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process
Figure 2 for RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process
Figure 3 for RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process
Figure 4 for RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process
Viaarxiv icon

LeanAgent: Lifelong Learning for Formal Theorem Proving

Add code
Oct 08, 2024
Figure 1 for LeanAgent: Lifelong Learning for Formal Theorem Proving
Figure 2 for LeanAgent: Lifelong Learning for Formal Theorem Proving
Figure 3 for LeanAgent: Lifelong Learning for Formal Theorem Proving
Figure 4 for LeanAgent: Lifelong Learning for Formal Theorem Proving
Viaarxiv icon

Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges

Add code
Sep 30, 2024
Figure 1 for Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges
Figure 2 for Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges
Viaarxiv icon

HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection

Add code
Sep 26, 2024
Viaarxiv icon

EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage

Add code
Sep 17, 2024
Figure 1 for EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage
Figure 2 for EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage
Figure 3 for EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage
Figure 4 for EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage
Viaarxiv icon

IDNet: A Novel Dataset for Identity Document Analysis and Fraud Detection

Add code
Aug 03, 2024
Figure 1 for IDNet: A Novel Dataset for Identity Document Analysis and Fraud Detection
Figure 2 for IDNet: A Novel Dataset for Identity Document Analysis and Fraud Detection
Figure 3 for IDNet: A Novel Dataset for Identity Document Analysis and Fraud Detection
Figure 4 for IDNet: A Novel Dataset for Identity Document Analysis and Fraud Detection
Viaarxiv icon

Can Editing LLMs Inject Harm?

Add code
Jul 29, 2024
Figure 1 for Can Editing LLMs Inject Harm?
Figure 2 for Can Editing LLMs Inject Harm?
Figure 3 for Can Editing LLMs Inject Harm?
Figure 4 for Can Editing LLMs Inject Harm?
Viaarxiv icon

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Add code
Jul 17, 2024
Viaarxiv icon

Consistency Purification: Effective and Efficient Diffusion Purification towards Certified Robustness

Add code
Jun 30, 2024
Viaarxiv icon

UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models

Add code
Jun 27, 2024
Figure 1 for UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models
Figure 2 for UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models
Figure 3 for UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models
Figure 4 for UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models
Viaarxiv icon