Picture for Jonas Geiping

Jonas Geiping

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Add code
Jun 14, 2024
Figure 1 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Figure 2 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Figure 3 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Figure 4 for Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
Viaarxiv icon

AI Risk Management Should Incorporate Both Safety and Security

Add code
May 29, 2024
Figure 1 for AI Risk Management Should Incorporate Both Safety and Security
Viaarxiv icon

Transformers Can Do Arithmetic with the Right Embeddings

Add code
May 27, 2024
Figure 1 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 2 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 3 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 4 for Transformers Can Do Arithmetic with the Right Embeddings
Viaarxiv icon

LMD3: Language Model Data Density Dependence

Add code
May 10, 2024
Viaarxiv icon

Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models

Add code
Apr 01, 2024
Figure 1 for Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models
Figure 2 for Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models
Figure 3 for Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models
Figure 4 for Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models
Viaarxiv icon

Measuring Style Similarity in Diffusion Models

Add code
Apr 01, 2024
Figure 1 for Measuring Style Similarity in Diffusion Models
Figure 2 for Measuring Style Similarity in Diffusion Models
Figure 3 for Measuring Style Similarity in Diffusion Models
Figure 4 for Measuring Style Similarity in Diffusion Models
Viaarxiv icon

Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion

Add code
Mar 25, 2024
Figure 1 for Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
Figure 2 for Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
Figure 3 for Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
Figure 4 for Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
Viaarxiv icon

What do we learn from inverting CLIP models?

Add code
Mar 05, 2024
Figure 1 for What do we learn from inverting CLIP models?
Figure 2 for What do we learn from inverting CLIP models?
Figure 3 for What do we learn from inverting CLIP models?
Figure 4 for What do we learn from inverting CLIP models?
Viaarxiv icon

Coercing LLMs to do and reveal anything

Add code
Feb 21, 2024
Viaarxiv icon

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Add code
Jan 22, 2024
Viaarxiv icon