Picture for Stella Biderman

Stella Biderman

Adversarial Samples Are Not Created Equal

Add code
Jan 02, 2026
Viaarxiv icon

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations

Add code
Nov 06, 2025
Figure 1 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 2 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 3 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Figure 4 for Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations
Viaarxiv icon

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Add code
Jun 05, 2025
Viaarxiv icon

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Add code
May 17, 2025
Viaarxiv icon

PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs

Add code
Mar 12, 2025
Figure 1 for PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
Figure 2 for PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
Figure 3 for PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
Figure 4 for PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
Viaarxiv icon

Open Problems in Mechanistic Interpretability

Add code
Jan 27, 2025
Figure 1 for Open Problems in Mechanistic Interpretability
Figure 2 for Open Problems in Mechanistic Interpretability
Figure 3 for Open Problems in Mechanistic Interpretability
Figure 4 for Open Problems in Mechanistic Interpretability
Viaarxiv icon

Towards Best Practices for Open Datasets for LLM Training

Add code
Jan 14, 2025
Viaarxiv icon

Bridging the Data Provenance Gap Across Text, Speech and Video

Add code
Dec 19, 2024
Figure 1 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 2 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 3 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 4 for Bridging the Data Provenance Gap Across Text, Speech and Video
Viaarxiv icon

A Walsh Hadamard Derived Linear Vector Symbolic Architecture

Add code
Oct 30, 2024
Figure 1 for A Walsh Hadamard Derived Linear Vector Symbolic Architecture
Figure 2 for A Walsh Hadamard Derived Linear Vector Symbolic Architecture
Figure 3 for A Walsh Hadamard Derived Linear Vector Symbolic Architecture
Figure 4 for A Walsh Hadamard Derived Linear Vector Symbolic Architecture
Viaarxiv icon

Consent in Crisis: The Rapid Decline of the AI Data Commons

Add code
Jul 24, 2024
Figure 1 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Figure 2 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Figure 3 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Figure 4 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Viaarxiv icon