Picture for Felix Friedrich

Felix Friedrich

LIME: Making LLM Data More Efficient with Linguistic Metadata Embeddings

Add code
Dec 08, 2025
Viaarxiv icon

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations

Add code
Nov 06, 2025
Figure 1 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 2 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 3 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 4 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Viaarxiv icon

Measuring and Guiding Monosemanticity

Add code
Jun 24, 2025
Viaarxiv icon

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Add code
Jun 11, 2025
Viaarxiv icon

EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition

Add code
May 26, 2025
Viaarxiv icon

The Cake that is Intelligence and Who Gets to Bake it: An AI Analogy and its Implications for Participation

Add code
Feb 06, 2025
Viaarxiv icon

MSTS: A Multimodal Safety Test Suite for Vision-Language Models

Add code
Jan 17, 2025
Viaarxiv icon

LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps

Add code
Dec 19, 2024
Figure 1 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 2 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 3 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 4 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Viaarxiv icon

Navigating Shortcuts, Spurious Correlations, and Confounders: From Origins via Detection to Mitigation

Add code
Dec 06, 2024
Figure 1 for Navigating Shortcuts, Spurious Correlations, and Confounders: From Origins via Detection to Mitigation
Figure 2 for Navigating Shortcuts, Spurious Correlations, and Confounders: From Origins via Detection to Mitigation
Figure 3 for Navigating Shortcuts, Spurious Correlations, and Confounders: From Origins via Detection to Mitigation
Figure 4 for Navigating Shortcuts, Spurious Correlations, and Confounders: From Origins via Detection to Mitigation
Viaarxiv icon

SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Add code
Nov 11, 2024
Viaarxiv icon