Picture for Tom Goldstein

Tom Goldstein

A Performance-Driven Benchmark for Feature Selection in Tabular Deep Learning

Add code
Nov 10, 2023
Figure 1 for A Performance-Driven Benchmark for Feature Selection in Tabular Deep Learning
Figure 2 for A Performance-Driven Benchmark for Feature Selection in Tabular Deep Learning
Figure 3 for A Performance-Driven Benchmark for Feature Selection in Tabular Deep Learning
Figure 4 for A Performance-Driven Benchmark for Feature Selection in Tabular Deep Learning
Viaarxiv icon

A Simple and Efficient Baseline for Data Attribution on Images

Add code
Nov 03, 2023
Viaarxiv icon

NEFTune: Noisy Embeddings Improve Instruction Finetuning

Add code
Oct 10, 2023
Viaarxiv icon

Baseline Defenses for Adversarial Attacks Against Aligned Language Models

Add code
Sep 04, 2023
Figure 1 for Baseline Defenses for Adversarial Attacks Against Aligned Language Models
Figure 2 for Baseline Defenses for Adversarial Attacks Against Aligned Language Models
Figure 3 for Baseline Defenses for Adversarial Attacks Against Aligned Language Models
Figure 4 for Baseline Defenses for Adversarial Attacks Against Aligned Language Models
Viaarxiv icon

On the Reliability of Watermarks for Large Language Models

Add code
Jun 30, 2023
Figure 1 for On the Reliability of Watermarks for Large Language Models
Figure 2 for On the Reliability of Watermarks for Large Language Models
Figure 3 for On the Reliability of Watermarks for Large Language Models
Figure 4 for On the Reliability of Watermarks for Large Language Models
Viaarxiv icon

Seeing in Words: Learning to Classify through Language Bottlenecks

Add code
Jun 29, 2023
Figure 1 for Seeing in Words: Learning to Classify through Language Bottlenecks
Figure 2 for Seeing in Words: Learning to Classify through Language Bottlenecks
Viaarxiv icon

Bring Your Own Data! Self-Supervised Evaluation for Large Language Models

Add code
Jun 29, 2023
Viaarxiv icon

On the Exploitability of Instruction Tuning

Add code
Jun 28, 2023
Figure 1 for On the Exploitability of Instruction Tuning
Figure 2 for On the Exploitability of Instruction Tuning
Figure 3 for On the Exploitability of Instruction Tuning
Figure 4 for On the Exploitability of Instruction Tuning
Viaarxiv icon

InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Add code
Jun 05, 2023
Viaarxiv icon

Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust

Add code
Jun 01, 2023
Viaarxiv icon