Hotpotqa


PersonalAI 2.0: Enhancing knowledge graph traversal/retrieval with planning mechanism for Personalized LLM Agents

Add code
May 13, 2026
Viaarxiv icon

Retrieval is Cheap, Show Me the Code: Executable Multi-Hop Reasoning for Retrieval-Augmented Generation

Add code
May 13, 2026
Viaarxiv icon

CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution

Add code
May 13, 2026
Viaarxiv icon

More Is Not Always Better: Cross-Component Interference in LLM Agent Scaffolding

Add code
May 07, 2026
Viaarxiv icon

SURE-RAG: Sufficiency and Uncertainty-Aware Evidence Verification for Selective Retrieval-Augmented Generation

Add code
May 05, 2026
Viaarxiv icon

How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum

Add code
Apr 28, 2026
Viaarxiv icon

S2G-RAG: Structured Sufficiency and Gap Judging for Iterative Retrieval-Augmented QA

Add code
Apr 26, 2026
Viaarxiv icon

Evaluating Multi-Hop Reasoning in RAG Systems: A Comparison of LLM-Based Retriever Evaluation Strategies

Add code
Apr 20, 2026
Viaarxiv icon

ContraPrompt: Contrastive Prompt Optimization via Dyadic Reasoning Trace Analysis

Add code
Apr 20, 2026
Viaarxiv icon

Answer Only as Precisely as Justified: Calibrated Claim-Level Specificity Control for Agentic Systems

Add code
Apr 19, 2026
Viaarxiv icon