Picture for John Harrison

John Harrison

Eds.

s2n-bignum-bench: A practical benchmark for evaluating low-level code reasoning of LLMs

Add code
Mar 15, 2026
Viaarxiv icon

Grandes Modelos de Linguagem Multimodais (MLLMs): Da Teoria à Prática

Add code
Feb 11, 2026
Viaarxiv icon

Do Multimodal LLMs See Sentiment?

Add code
Aug 23, 2025
Viaarxiv icon

LPAR-05 Workshop: Empirically Successfull Automated Reasoning in Higher-Order Logic (ESHOL)

Add code
Jan 10, 2006
Figure 1 for LPAR-05 Workshop: Empirically Successfull Automated Reasoning in Higher-Order Logic (ESHOL)
Figure 2 for LPAR-05 Workshop: Empirically Successfull Automated Reasoning in Higher-Order Logic (ESHOL)
Figure 3 for LPAR-05 Workshop: Empirically Successfull Automated Reasoning in Higher-Order Logic (ESHOL)
Figure 4 for LPAR-05 Workshop: Empirically Successfull Automated Reasoning in Higher-Order Logic (ESHOL)
Viaarxiv icon