Alert button
Picture for Edoardo Mosca

Edoardo Mosca

Alert button

Simpler becomes Harder: Do LLMs Exhibit a Coherent Behavior on Simplified Corpora?

Add code
Bookmark button
Alert button
Apr 10, 2024
Miriam Anschütz, Edoardo Mosca, Georg Groh

Viaarxiv icon

IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models

Add code
Bookmark button
Alert button
Mar 06, 2023
Edoardo Mosca, Daryna Dementieva, Tohid Ebrahim Ajdari, Maximilian Kummeth, Kirill Gringauz, Georg Groh

Figure 1 for IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models
Figure 2 for IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models
Figure 3 for IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models
Figure 4 for IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models
Viaarxiv icon

"That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks

Add code
Bookmark button
Alert button
Apr 10, 2022
Edoardo Mosca, Shreyash Agarwal, Javier Rando-Ramirez, Georg Groh

Figure 1 for "That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks
Figure 2 for "That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks
Figure 3 for "That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks
Figure 4 for "That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks
Viaarxiv icon