Empirically evaluating commonsense intelligence in large language models with large-scale human judgments

Add code
May 15, 2025
Figure 1 for Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Figure 2 for Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Figure 3 for Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Figure 4 for Empirically evaluating commonsense intelligence in large language models with large-scale human judgments

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: