Picture for Tom Goldstein

Tom Goldstein

OPTune: Efficient Online Preference Tuning

Add code
Jun 11, 2024
Figure 1 for OPTune: Efficient Online Preference Tuning
Figure 2 for OPTune: Efficient Online Preference Tuning
Figure 3 for OPTune: Efficient Online Preference Tuning
Figure 4 for OPTune: Efficient Online Preference Tuning
Viaarxiv icon

The CLRS-Text Algorithmic Reasoning Language Benchmark

Add code
Jun 06, 2024
Figure 1 for The CLRS-Text Algorithmic Reasoning Language Benchmark
Figure 2 for The CLRS-Text Algorithmic Reasoning Language Benchmark
Figure 3 for The CLRS-Text Algorithmic Reasoning Language Benchmark
Figure 4 for The CLRS-Text Algorithmic Reasoning Language Benchmark
Viaarxiv icon

Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement

Add code
May 29, 2024
Figure 1 for Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
Figure 2 for Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
Figure 3 for Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
Figure 4 for Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
Viaarxiv icon

Transformers Can Do Arithmetic with the Right Embeddings

Add code
May 27, 2024
Figure 1 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 2 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 3 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 4 for Transformers Can Do Arithmetic with the Right Embeddings
Viaarxiv icon

CinePile: A Long Video Question Answering Dataset and Benchmark

Add code
May 14, 2024
Figure 1 for CinePile: A Long Video Question Answering Dataset and Benchmark
Figure 2 for CinePile: A Long Video Question Answering Dataset and Benchmark
Figure 3 for CinePile: A Long Video Question Answering Dataset and Benchmark
Figure 4 for CinePile: A Long Video Question Answering Dataset and Benchmark
Viaarxiv icon

LMD3: Language Model Data Density Dependence

Add code
May 10, 2024
Figure 1 for LMD3: Language Model Data Density Dependence
Figure 2 for LMD3: Language Model Data Density Dependence
Figure 3 for LMD3: Language Model Data Density Dependence
Figure 4 for LMD3: Language Model Data Density Dependence
Viaarxiv icon

Benchmarking ChatGPT on Algorithmic Reasoning

Add code
Apr 04, 2024
Figure 1 for Benchmarking ChatGPT on Algorithmic Reasoning
Figure 2 for Benchmarking ChatGPT on Algorithmic Reasoning
Figure 3 for Benchmarking ChatGPT on Algorithmic Reasoning
Figure 4 for Benchmarking ChatGPT on Algorithmic Reasoning
Viaarxiv icon

Measuring Style Similarity in Diffusion Models

Add code
Apr 01, 2024
Figure 1 for Measuring Style Similarity in Diffusion Models
Figure 2 for Measuring Style Similarity in Diffusion Models
Figure 3 for Measuring Style Similarity in Diffusion Models
Figure 4 for Measuring Style Similarity in Diffusion Models
Viaarxiv icon

Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models

Add code
Apr 01, 2024
Viaarxiv icon

Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion

Add code
Mar 25, 2024
Figure 1 for Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
Figure 2 for Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
Figure 3 for Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
Figure 4 for Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
Viaarxiv icon