Picture for Patrick Schramowski

Patrick Schramowski

Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models

Add code
May 28, 2025
Viaarxiv icon

MSTS: A Multimodal Safety Test Suite for Vision-Language Models

Add code
Jan 17, 2025
Viaarxiv icon

LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps

Add code
Dec 19, 2024
Figure 1 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 2 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 3 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 4 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Viaarxiv icon

SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Add code
Nov 11, 2024
Viaarxiv icon

Core Tokensets for Data-efficient Sequential Training of Transformers

Add code
Oct 08, 2024
Figure 1 for Core Tokensets for Data-efficient Sequential Training of Transformers
Figure 2 for Core Tokensets for Data-efficient Sequential Training of Transformers
Figure 3 for Core Tokensets for Data-efficient Sequential Training of Transformers
Figure 4 for Core Tokensets for Data-efficient Sequential Training of Transformers
Viaarxiv icon

Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning

Add code
Jul 03, 2024
Viaarxiv icon

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Add code
Jun 27, 2024
Figure 1 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 2 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 3 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 4 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Viaarxiv icon

LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment

Add code
Jun 07, 2024
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming

Add code
Apr 06, 2024
Figure 1 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 2 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 3 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 4 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Viaarxiv icon