Picture for Amir Houmansadr

Amir Houmansadr

Can Large Language Models Really Recognize Your Name?

Add code
May 20, 2025
Viaarxiv icon

R1dacted: Investigating Local Censorship in DeepSeek's R1 Language Model

Add code
May 19, 2025
Viaarxiv icon

VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models

Add code
May 02, 2025
Viaarxiv icon

Multilingual and Multi-Accent Jailbreaking of Audio LLMs

Add code
Apr 01, 2025
Viaarxiv icon

OverThink: Slowdown Attacks on Reasoning LLMs

Add code
Feb 05, 2025
Viaarxiv icon

OVERTHINKING: Slowdown Attacks on Reasoning LLMs

Add code
Feb 04, 2025
Viaarxiv icon

ULTra: Unveiling Latent Token Interpretability in Transformer Based Understanding

Add code
Nov 15, 2024
Figure 1 for ULTra: Unveiling Latent Token Interpretability in Transformer Based Understanding
Figure 2 for ULTra: Unveiling Latent Token Interpretability in Transformer Based Understanding
Figure 3 for ULTra: Unveiling Latent Token Interpretability in Transformer Based Understanding
Figure 4 for ULTra: Unveiling Latent Token Interpretability in Transformer Based Understanding
Viaarxiv icon

Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors

Add code
Nov 03, 2024
Figure 1 for Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors
Figure 2 for Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors
Figure 3 for Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors
Figure 4 for Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors
Viaarxiv icon

Bias Similarity Across Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon

Injecting Bias in Text-To-Image Models via Composite-Trigger Backdoors

Add code
Jun 21, 2024
Figure 1 for Injecting Bias in Text-To-Image Models via Composite-Trigger Backdoors
Figure 2 for Injecting Bias in Text-To-Image Models via Composite-Trigger Backdoors
Figure 3 for Injecting Bias in Text-To-Image Models via Composite-Trigger Backdoors
Figure 4 for Injecting Bias in Text-To-Image Models via Composite-Trigger Backdoors
Viaarxiv icon