Picture for Gonzalo Martínez

Gonzalo Martínez

To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times

Add code
Mar 12, 2026
Viaarxiv icon

Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings

Add code
Sep 17, 2025
Figure 1 for Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings
Figure 2 for Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings
Figure 3 for Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings
Viaarxiv icon

The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations

Add code
Jul 17, 2025
Figure 1 for The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations
Figure 2 for The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations
Figure 3 for The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations
Figure 4 for The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations
Viaarxiv icon

La Leaderboard: A Large Language Model Leaderboard for Spanish Varieties and Languages of Spain and Latin America

Add code
Jul 01, 2025
Viaarxiv icon

Understanding the Impact of Artificial Intelligence in Academic Writing: Metadata to the Rescue

Add code
Feb 23, 2025
Figure 1 for Understanding the Impact of Artificial Intelligence in Academic Writing: Metadata to the Rescue
Figure 2 for Understanding the Impact of Artificial Intelligence in Academic Writing: Metadata to the Rescue
Figure 3 for Understanding the Impact of Artificial Intelligence in Academic Writing: Metadata to the Rescue
Viaarxiv icon

Can ChatGPT Learn to Count Letters?

Add code
Feb 23, 2025
Figure 1 for Can ChatGPT Learn to Count Letters?
Figure 2 for Can ChatGPT Learn to Count Letters?
Figure 3 for Can ChatGPT Learn to Count Letters?
Figure 4 for Can ChatGPT Learn to Count Letters?
Viaarxiv icon

Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong

Add code
Jan 16, 2025
Figure 1 for Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Figure 2 for Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Figure 3 for Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Figure 4 for Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong
Viaarxiv icon

Open Source Conversational LLMs do not know most Spanish words

Add code
Mar 21, 2024
Figure 1 for Open Source Conversational LLMs do not know most Spanish words
Figure 2 for Open Source Conversational LLMs do not know most Spanish words
Figure 3 for Open Source Conversational LLMs do not know most Spanish words
Figure 4 for Open Source Conversational LLMs do not know most Spanish words
Viaarxiv icon

Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models

Add code
Feb 11, 2024
Figure 1 for Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models
Figure 2 for Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models
Figure 3 for Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models
Figure 4 for Beware of Words: Evaluating the Lexical Richness of Conversational Large Language Models
Viaarxiv icon

The continued usefulness of vocabulary tests for evaluating large language models

Add code
Oct 23, 2023
Figure 1 for The continued usefulness of vocabulary tests for evaluating large language models
Figure 2 for The continued usefulness of vocabulary tests for evaluating large language models
Figure 3 for The continued usefulness of vocabulary tests for evaluating large language models
Figure 4 for The continued usefulness of vocabulary tests for evaluating large language models
Viaarxiv icon