Picture for Patrick Schramowski

Patrick Schramowski

Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning

Add code
Jul 03, 2024
Viaarxiv icon

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Add code
Jun 27, 2024
Figure 1 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 2 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 3 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 4 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Viaarxiv icon

LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment

Add code
Jun 07, 2024
Figure 1 for LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment
Figure 2 for LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment
Figure 3 for LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment
Figure 4 for LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming

Add code
Apr 06, 2024
Figure 1 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 2 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 3 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 4 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Viaarxiv icon

DeiSAM: Segment Anything with Deictic Prompting

Add code
Feb 21, 2024
Figure 1 for DeiSAM: Segment Anything with Deictic Prompting
Figure 2 for DeiSAM: Segment Anything with Deictic Prompting
Figure 3 for DeiSAM: Segment Anything with Deictic Prompting
Figure 4 for DeiSAM: Segment Anything with Deictic Prompting
Viaarxiv icon

Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You

Add code
Jan 31, 2024
Figure 1 for Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
Figure 2 for Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
Figure 3 for Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
Figure 4 for Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
Viaarxiv icon

LEDITS++: Limitless Image Editing using Text-to-Image Models

Add code
Nov 28, 2023
Figure 1 for LEDITS++: Limitless Image Editing using Text-to-Image Models
Figure 2 for LEDITS++: Limitless Image Editing using Text-to-Image Models
Figure 3 for LEDITS++: Limitless Image Editing using Text-to-Image Models
Figure 4 for LEDITS++: Limitless Image Editing using Text-to-Image Models
Viaarxiv icon

Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization

Add code
Nov 13, 2023
Figure 1 for Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization
Figure 2 for Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization
Figure 3 for Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization
Figure 4 for Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization
Viaarxiv icon

Distilling Adversarial Prompts from Safety Benchmarks: Report for the Adversarial Nibbler Challenge

Add code
Sep 20, 2023
Figure 1 for Distilling Adversarial Prompts from Safety Benchmarks: Report for the Adversarial Nibbler Challenge
Figure 2 for Distilling Adversarial Prompts from Safety Benchmarks: Report for the Adversarial Nibbler Challenge
Figure 3 for Distilling Adversarial Prompts from Safety Benchmarks: Report for the Adversarial Nibbler Challenge
Figure 4 for Distilling Adversarial Prompts from Safety Benchmarks: Report for the Adversarial Nibbler Challenge
Viaarxiv icon