Hotpotqa


CompactRAG: Reducing LLM Calls and Token Overhead in Multi-Hop Question Answering

Add code
Feb 05, 2026
Viaarxiv icon

Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

Add code
Feb 05, 2026
Viaarxiv icon

EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL

Add code
Feb 04, 2026
Viaarxiv icon

Atomic Information Flow: A Network Flow Model for Tool Attributions in RAG Systems

Add code
Feb 04, 2026
Viaarxiv icon

ATACompressor: Adaptive Task-Aware Compression for Efficient Long-Context Processing in LLMs

Add code
Feb 03, 2026
Viaarxiv icon

Fat-Cat: Document-Driven Metacognitive Multi-Agent System for Complex Reasoning

Add code
Feb 02, 2026
Viaarxiv icon

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

Add code
Feb 02, 2026
Viaarxiv icon

COMI: Coarse-to-fine Context Compression via Marginal Information Gain

Add code
Feb 02, 2026
Viaarxiv icon

"I May Not Have Articulated Myself Clearly": Diagnosing Dynamic Instability in LLM Reasoning at Inference Time

Add code
Feb 02, 2026
Viaarxiv icon

ShardMemo: Masked MoE Routing for Sharded Agentic LLM Memory

Add code
Jan 29, 2026
Viaarxiv icon