Picture for Samuele Poppi

Samuele Poppi

Robust Safety Monitoring of Language Models via Activation Watermarking

Add code
Mar 24, 2026
Viaarxiv icon

Robust and Calibrated Detection of Authentic Multimedia Content

Add code
Dec 17, 2025
Figure 1 for Robust and Calibrated Detection of Authentic Multimedia Content
Figure 2 for Robust and Calibrated Detection of Authentic Multimedia Content
Figure 3 for Robust and Calibrated Detection of Authentic Multimedia Content
Figure 4 for Robust and Calibrated Detection of Authentic Multimedia Content
Viaarxiv icon

Mitigating Watermark Stealing Attacks in Generative Models via Multi-Key Watermarking

Add code
Jul 10, 2025
Viaarxiv icon

Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack

Add code
May 21, 2025
Viaarxiv icon

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks

Add code
Oct 23, 2024
Viaarxiv icon

Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation

Add code
Nov 27, 2023
Viaarxiv icon

Multi-Class Explainable Unlearning for Image Classification via Weight Filtering

Add code
Apr 04, 2023
Viaarxiv icon

Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis

Add code
Apr 20, 2021
Figure 1 for Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis
Figure 2 for Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis
Figure 3 for Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis
Figure 4 for Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis
Viaarxiv icon