Picture for Spencer Mateega

Spencer Mateega

Market-Bench: Evaluating Large Language Models on Introductory Quantitative Trading and Market Dynamics

Add code
Dec 13, 2025
Figure 1 for Market-Bench: Evaluating Large Language Models on Introductory Quantitative Trading and Market Dynamics
Figure 2 for Market-Bench: Evaluating Large Language Models on Introductory Quantitative Trading and Market Dynamics
Figure 3 for Market-Bench: Evaluating Large Language Models on Introductory Quantitative Trading and Market Dynamics
Figure 4 for Market-Bench: Evaluating Large Language Models on Introductory Quantitative Trading and Market Dynamics
Viaarxiv icon

VADER: A Human-Evaluated Benchmark for Vulnerability Assessment, Detection, Explanation, and Remediation

Add code
May 26, 2025
Viaarxiv icon

FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models

Add code
Jan 30, 2025
Viaarxiv icon