Picture for Avi Schwarzschild

Avi Schwarzschild

The CLRS-Text Algorithmic Reasoning Language Benchmark

Add code
Jun 06, 2024
Viaarxiv icon

Transformers Can Do Arithmetic with the Right Embeddings

Add code
May 27, 2024
Figure 1 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 2 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 3 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 4 for Transformers Can Do Arithmetic with the Right Embeddings
Viaarxiv icon

Rethinking LLM Memorization through the Lens of Adversarial Compression

Add code
Apr 23, 2024
Viaarxiv icon

Forcing Diffuse Distributions out of Language Models

Add code
Apr 16, 2024
Figure 1 for Forcing Diffuse Distributions out of Language Models
Figure 2 for Forcing Diffuse Distributions out of Language Models
Figure 3 for Forcing Diffuse Distributions out of Language Models
Figure 4 for Forcing Diffuse Distributions out of Language Models
Viaarxiv icon

Benchmarking ChatGPT on Algorithmic Reasoning

Add code
Apr 04, 2024
Viaarxiv icon

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Add code
Jan 22, 2024
Viaarxiv icon

TOFU: A Task of Fictitious Unlearning for LLMs

Add code
Jan 11, 2024
Viaarxiv icon

Effective Backdoor Mitigation Depends on the Pre-training Objective

Add code
Dec 05, 2023
Figure 1 for Effective Backdoor Mitigation Depends on the Pre-training Objective
Figure 2 for Effective Backdoor Mitigation Depends on the Pre-training Objective
Figure 3 for Effective Backdoor Mitigation Depends on the Pre-training Objective
Figure 4 for Effective Backdoor Mitigation Depends on the Pre-training Objective
Viaarxiv icon

NEFTune: Noisy Embeddings Improve Instruction Finetuning

Add code
Oct 10, 2023
Figure 1 for NEFTune: Noisy Embeddings Improve Instruction Finetuning
Figure 2 for NEFTune: Noisy Embeddings Improve Instruction Finetuning
Figure 3 for NEFTune: Noisy Embeddings Improve Instruction Finetuning
Figure 4 for NEFTune: Noisy Embeddings Improve Instruction Finetuning
Viaarxiv icon

Baseline Defenses for Adversarial Attacks Against Aligned Language Models

Add code
Sep 04, 2023
Figure 1 for Baseline Defenses for Adversarial Attacks Against Aligned Language Models
Figure 2 for Baseline Defenses for Adversarial Attacks Against Aligned Language Models
Figure 3 for Baseline Defenses for Adversarial Attacks Against Aligned Language Models
Figure 4 for Baseline Defenses for Adversarial Attacks Against Aligned Language Models
Viaarxiv icon