Picture for Saeed Mahloujifar

Saeed Mahloujifar

How much do language models memorize?

Add code
May 30, 2025
Viaarxiv icon

Detecting Benchmark Contamination Through Watermarking

Add code
Feb 24, 2025
Viaarxiv icon

Unlocking Visual Secrets: Inverting Features with Diffusion Priors for Image Reconstruction

Add code
Dec 11, 2024
Viaarxiv icon

Auditing $f$-Differential Privacy in One Run

Add code
Oct 29, 2024
Figure 1 for Auditing $f$-Differential Privacy in One Run
Figure 2 for Auditing $f$-Differential Privacy in One Run
Figure 3 for Auditing $f$-Differential Privacy in One Run
Figure 4 for Auditing $f$-Differential Privacy in One Run
Viaarxiv icon

Aligning LLMs to Be Robust Against Prompt Injection

Add code
Oct 07, 2024
Figure 1 for Aligning LLMs to Be Robust Against Prompt Injection
Figure 2 for Aligning LLMs to Be Robust Against Prompt Injection
Figure 3 for Aligning LLMs to Be Robust Against Prompt Injection
Figure 4 for Aligning LLMs to Be Robust Against Prompt Injection
Viaarxiv icon

Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds

Add code
Apr 06, 2024
Figure 1 for Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds
Figure 2 for Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds
Figure 3 for Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds
Figure 4 for Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds
Viaarxiv icon

Privacy Amplification for the Gaussian Mechanism via Bounded Support

Add code
Mar 07, 2024
Figure 1 for Privacy Amplification for the Gaussian Mechanism via Bounded Support
Figure 2 for Privacy Amplification for the Gaussian Mechanism via Bounded Support
Figure 3 for Privacy Amplification for the Gaussian Mechanism via Bounded Support
Figure 4 for Privacy Amplification for the Gaussian Mechanism via Bounded Support
Viaarxiv icon

Private Fine-tuning of Large Language Models with Zeroth-order Optimization

Add code
Jan 09, 2024
Figure 1 for Private Fine-tuning of Large Language Models with Zeroth-order Optimization
Figure 2 for Private Fine-tuning of Large Language Models with Zeroth-order Optimization
Figure 3 for Private Fine-tuning of Large Language Models with Zeroth-order Optimization
Figure 4 for Private Fine-tuning of Large Language Models with Zeroth-order Optimization
Viaarxiv icon

Publicly Detectable Watermarking for Language Models

Add code
Oct 27, 2023
Viaarxiv icon

A Randomized Approach for Tight Privacy Accounting

Add code
Apr 17, 2023
Viaarxiv icon