Picture for Natalia Pérez-Campanero Antolín

Natalia Pérez-Campanero Antolín

Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems

Add code
Apr 10, 2025
Viaarxiv icon

Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering

Add code
Mar 17, 2025
Viaarxiv icon

CryptoFormalEval: Integrating LLMs and Formal Verification for Automated Cryptographic Protocol Vulnerability Detection

Add code
Nov 20, 2024
Figure 1 for CryptoFormalEval: Integrating LLMs and Formal Verification for Automated Cryptographic Protocol Vulnerability Detection
Figure 2 for CryptoFormalEval: Integrating LLMs and Formal Verification for Automated Cryptographic Protocol Vulnerability Detection
Figure 3 for CryptoFormalEval: Integrating LLMs and Formal Verification for Automated Cryptographic Protocol Vulnerability Detection
Figure 4 for CryptoFormalEval: Integrating LLMs and Formal Verification for Automated Cryptographic Protocol Vulnerability Detection
Viaarxiv icon