Picture for Mikel Rodriguez

Mikel Rodriguez

STAR: SocioTechnical Approach to Red Teaming Language Models

Add code
Jun 17, 2024
Figure 1 for STAR: SocioTechnical Approach to Red Teaming Language Models
Figure 2 for STAR: SocioTechnical Approach to Red Teaming Language Models
Figure 3 for STAR: SocioTechnical Approach to Red Teaming Language Models
Figure 4 for STAR: SocioTechnical Approach to Red Teaming Language Models
Viaarxiv icon

Holistic Safety and Responsibility Evaluations of Advanced AI Models

Add code
Apr 22, 2024
Viaarxiv icon

Responsible Reporting for Frontier AI Development

Add code
Apr 03, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Adversarial Machine Learning and Cybersecurity: Risks, Challenges, and Legal Implications

Add code
May 23, 2023
Viaarxiv icon

Adversarial Attack Attribution: Discovering Attributable Signals in Adversarial ML Attacks

Add code
Jan 08, 2021
Figure 1 for Adversarial Attack Attribution: Discovering Attributable Signals in Adversarial ML Attacks
Figure 2 for Adversarial Attack Attribution: Discovering Attributable Signals in Adversarial ML Attacks
Figure 3 for Adversarial Attack Attribution: Discovering Attributable Signals in Adversarial ML Attacks
Figure 4 for Adversarial Attack Attribution: Discovering Attributable Signals in Adversarial ML Attacks
Viaarxiv icon

Learning a Predictable and Generative Vector Representation for Objects

Add code
Aug 31, 2016
Figure 1 for Learning a Predictable and Generative Vector Representation for Objects
Figure 2 for Learning a Predictable and Generative Vector Representation for Objects
Figure 3 for Learning a Predictable and Generative Vector Representation for Objects
Figure 4 for Learning a Predictable and Generative Vector Representation for Objects
Viaarxiv icon