Picture for Haritz Puerto

Haritz Puerto

Models That Know How Evaluations Are Designed Score Safer

Add code
May 27, 2026
Viaarxiv icon

Controllable Reasoning Models Are Private Thinkers

Add code
Feb 27, 2026
Viaarxiv icon

Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers

Add code
Jun 18, 2025
Viaarxiv icon

Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models

Add code
Oct 31, 2024
Figure 1 for Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models
Figure 2 for Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models
Figure 3 for Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models
Figure 4 for Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models
Viaarxiv icon

Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models

Add code
Jul 03, 2024
Figure 1 for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models
Figure 2 for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models
Figure 3 for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models
Figure 4 for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models
Viaarxiv icon

Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs

Add code
Jan 18, 2024
Figure 1 for Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
Figure 2 for Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
Figure 3 for Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
Figure 4 for Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs
Viaarxiv icon

Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research

Add code
Jun 29, 2023
Figure 1 for Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research
Figure 2 for Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research
Figure 3 for Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research
Figure 4 for Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research
Viaarxiv icon

UKP-SQuARE: An Interactive Tool for Teaching Question Answering

Add code
Jun 02, 2023
Figure 1 for UKP-SQuARE: An Interactive Tool for Teaching Question Answering
Figure 2 for UKP-SQuARE: An Interactive Tool for Teaching Question Answering
Figure 3 for UKP-SQuARE: An Interactive Tool for Teaching Question Answering
Figure 4 for UKP-SQuARE: An Interactive Tool for Teaching Question Answering
Viaarxiv icon

UKP-SQuARE v3: A Platform for Multi-Agent QA Research

Add code
Mar 31, 2023
Figure 1 for UKP-SQuARE v3: A Platform for Multi-Agent QA Research
Figure 2 for UKP-SQuARE v3: A Platform for Multi-Agent QA Research
Figure 3 for UKP-SQuARE v3: A Platform for Multi-Agent QA Research
Figure 4 for UKP-SQuARE v3: A Platform for Multi-Agent QA Research
Viaarxiv icon

UKP-SQuARE v2 Explainability and Adversarial Attacks for Trustworthy QA

Add code
Aug 23, 2022
Figure 1 for UKP-SQuARE v2 Explainability and Adversarial Attacks for Trustworthy QA
Figure 2 for UKP-SQuARE v2 Explainability and Adversarial Attacks for Trustworthy QA
Figure 3 for UKP-SQuARE v2 Explainability and Adversarial Attacks for Trustworthy QA
Figure 4 for UKP-SQuARE v2 Explainability and Adversarial Attacks for Trustworthy QA
Viaarxiv icon