Picture for Mani Malek

Mani Malek

Hiding in Plain Text: Detecting Concealed Jailbreaks via Activation Disentanglement

Add code
Feb 23, 2026
Viaarxiv icon

Interpreting and Controlling Model Behavior via Constitutions for Atomic Concept Edits

Add code
Jan 23, 2026
Viaarxiv icon

ShieldGemma 2: Robust and Tractable Image Content Moderation

Add code
Apr 01, 2025
Viaarxiv icon

FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning

Add code
Jun 07, 2022
Figure 1 for FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning
Figure 2 for FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning
Figure 3 for FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning
Figure 4 for FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning
Viaarxiv icon

Papaya: Practical, Private, and Scalable Federated Learning

Add code
Nov 08, 2021
Figure 1 for Papaya: Practical, Private, and Scalable Federated Learning
Figure 2 for Papaya: Practical, Private, and Scalable Federated Learning
Figure 3 for Papaya: Practical, Private, and Scalable Federated Learning
Figure 4 for Papaya: Practical, Private, and Scalable Federated Learning
Viaarxiv icon

Opacus: User-Friendly Differential Privacy Library in PyTorch

Add code
Oct 05, 2021
Figure 1 for Opacus: User-Friendly Differential Privacy Library in PyTorch
Figure 2 for Opacus: User-Friendly Differential Privacy Library in PyTorch
Viaarxiv icon

Antipodes of Label Differential Privacy: PATE and ALIBI

Add code
Jun 07, 2021
Figure 1 for Antipodes of Label Differential Privacy: PATE and ALIBI
Figure 2 for Antipodes of Label Differential Privacy: PATE and ALIBI
Figure 3 for Antipodes of Label Differential Privacy: PATE and ALIBI
Figure 4 for Antipodes of Label Differential Privacy: PATE and ALIBI
Viaarxiv icon