Picture for Patrick Schramowski

Patrick Schramowski

Measuring and Guiding Monosemanticity

Add code
Jun 24, 2025
Viaarxiv icon

Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models

Add code
May 28, 2025
Viaarxiv icon

MSTS: A Multimodal Safety Test Suite for Vision-Language Models

Add code
Jan 17, 2025
Viaarxiv icon

LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps

Add code
Dec 19, 2024
Figure 1 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 2 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 3 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 4 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Viaarxiv icon

SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Add code
Nov 11, 2024
Viaarxiv icon

Core Tokensets for Data-efficient Sequential Training of Transformers

Add code
Oct 08, 2024
Figure 1 for Core Tokensets for Data-efficient Sequential Training of Transformers
Figure 2 for Core Tokensets for Data-efficient Sequential Training of Transformers
Figure 3 for Core Tokensets for Data-efficient Sequential Training of Transformers
Figure 4 for Core Tokensets for Data-efficient Sequential Training of Transformers
Viaarxiv icon

Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning

Add code
Jul 03, 2024
Viaarxiv icon

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Add code
Jun 27, 2024
Figure 1 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 2 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 3 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Figure 4 for T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
Viaarxiv icon

LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment

Add code
Jun 07, 2024
Figure 1 for LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment
Figure 2 for LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment
Figure 3 for LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment
Figure 4 for LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon