Picture for Benjamin I. P. Rubinstein

Benjamin I. P. Rubinstein

Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement Learning

Add code
May 27, 2025
Viaarxiv icon

Fox in the Henhouse: Supply-Chain Backdoor Attacks Against Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

DDAD: A Two-pronged Adversarial Defense Based on Distributional Discrepancy

Add code
Mar 04, 2025
Viaarxiv icon

CERT-ED: Certifiably Robust Text Classification for Edit Distance

Add code
Aug 01, 2024
Viaarxiv icon

Adaptive Data Analysis for Growing Data

Add code
May 22, 2024
Viaarxiv icon

SEEP: Training Dynamics Grounds Latent Representation Search for Mitigating Backdoor Poisoning Attacks

Add code
May 19, 2024
Viaarxiv icon

RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing

Add code
May 14, 2024
Figure 1 for RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing
Figure 2 for RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing
Figure 3 for RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing
Figure 4 for RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing
Viaarxiv icon

Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning

Add code
Apr 30, 2024
Figure 1 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Figure 2 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Figure 3 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Figure 4 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Viaarxiv icon

Backdoor Attack on Multilingual Machine Translation

Add code
Apr 03, 2024
Viaarxiv icon

It's Simplex! Disaggregating Measures to Improve Certified Robustness

Add code
Sep 20, 2023
Figure 1 for It's Simplex! Disaggregating Measures to Improve Certified Robustness
Figure 2 for It's Simplex! Disaggregating Measures to Improve Certified Robustness
Figure 3 for It's Simplex! Disaggregating Measures to Improve Certified Robustness
Figure 4 for It's Simplex! Disaggregating Measures to Improve Certified Robustness
Viaarxiv icon