Picture for Pablo Valle

Pablo Valle

Evaluating Uncertainty and Quality of Visual Language Action-enabled Robots

Add code
Jul 22, 2025
Viaarxiv icon

Red Teaming Contemporary AI Models: Insights from Spanish and Basque Perspectives

Add code
Mar 13, 2025
Viaarxiv icon

o3-mini vs DeepSeek-R1: Which One is Safer?

Add code
Jan 31, 2025
Viaarxiv icon

Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation

Add code
Jan 29, 2025
Viaarxiv icon

ASTRAL: Automated Safety Testing of Large Language Models

Add code
Jan 28, 2025
Viaarxiv icon