Picture for David Wagner

David Wagner

PAL: Proxy-Guided Black-Box Attack on Large Language Models

Add code
Feb 15, 2024
Figure 1 for PAL: Proxy-Guided Black-Box Attack on Large Language Models
Figure 2 for PAL: Proxy-Guided Black-Box Attack on Large Language Models
Figure 3 for PAL: Proxy-Guided Black-Box Attack on Large Language Models
Figure 4 for PAL: Proxy-Guided Black-Box Attack on Large Language Models
Viaarxiv icon

Jatmo: Prompt Injection Defense by Task-Specific Finetuning

Add code
Jan 08, 2024
Figure 1 for Jatmo: Prompt Injection Defense by Task-Specific Finetuning
Figure 2 for Jatmo: Prompt Injection Defense by Task-Specific Finetuning
Figure 3 for Jatmo: Prompt Injection Defense by Task-Specific Finetuning
Figure 4 for Jatmo: Prompt Injection Defense by Task-Specific Finetuning
Viaarxiv icon

Mark My Words: Analyzing and Evaluating Language Model Watermarks

Add code
Dec 07, 2023
Figure 1 for Mark My Words: Analyzing and Evaluating Language Model Watermarks
Figure 2 for Mark My Words: Analyzing and Evaluating Language Model Watermarks
Figure 3 for Mark My Words: Analyzing and Evaluating Language Model Watermarks
Figure 4 for Mark My Words: Analyzing and Evaluating Language Model Watermarks
Viaarxiv icon

Can LLMs Follow Simple Rules?

Add code
Nov 06, 2023
Figure 1 for Can LLMs Follow Simple Rules?
Figure 2 for Can LLMs Follow Simple Rules?
Figure 3 for Can LLMs Follow Simple Rules?
Figure 4 for Can LLMs Follow Simple Rules?
Viaarxiv icon

Defending Against Transfer Attacks From Public Models

Add code
Oct 26, 2023
Viaarxiv icon

DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection

Add code
Apr 01, 2023
Figure 1 for DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection
Figure 2 for DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection
Figure 3 for DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection
Figure 4 for DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection
Viaarxiv icon

Continuous Learning for Android Malware Detection

Add code
Feb 08, 2023
Figure 1 for Continuous Learning for Android Malware Detection
Figure 2 for Continuous Learning for Android Malware Detection
Figure 3 for Continuous Learning for Android Malware Detection
Figure 4 for Continuous Learning for Android Malware Detection
Viaarxiv icon

REAP: A Large-Scale Realistic Adversarial Patch Benchmark

Add code
Dec 12, 2022
Figure 1 for REAP: A Large-Scale Realistic Adversarial Patch Benchmark
Figure 2 for REAP: A Large-Scale Realistic Adversarial Patch Benchmark
Figure 3 for REAP: A Large-Scale Realistic Adversarial Patch Benchmark
Figure 4 for REAP: A Large-Scale Realistic Adversarial Patch Benchmark
Viaarxiv icon

Part-Based Models Improve Adversarial Robustness

Add code
Sep 15, 2022
Figure 1 for Part-Based Models Improve Adversarial Robustness
Figure 2 for Part-Based Models Improve Adversarial Robustness
Figure 3 for Part-Based Models Improve Adversarial Robustness
Figure 4 for Part-Based Models Improve Adversarial Robustness
Viaarxiv icon

SLIP: Self-supervision meets Language-Image Pre-training

Add code
Dec 23, 2021
Figure 1 for SLIP: Self-supervision meets Language-Image Pre-training
Figure 2 for SLIP: Self-supervision meets Language-Image Pre-training
Figure 3 for SLIP: Self-supervision meets Language-Image Pre-training
Figure 4 for SLIP: Self-supervision meets Language-Image Pre-training
Viaarxiv icon