Picture for Dario Satriani

Dario Satriani

Constraint Decay: The Fragility of LLM Agents in Backend Code Generation

Add code
May 07, 2026
Viaarxiv icon

RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models

Add code
May 27, 2025
Figure 1 for RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models
Figure 2 for RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models
Figure 3 for RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models
Figure 4 for RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models
Viaarxiv icon