Topic


Overcoming the Retrieval Barrier: Indirect Prompt Injection in the Wild for LLM Systems

Add code
Jan 11, 2026
Viaarxiv icon

HiMem: Hierarchical Long-Term Memory for LLM Long-Horizon Agents

Add code
Jan 10, 2026
Viaarxiv icon

NC-Bench: An LLM Benchmark for Evaluating Conversational Competence

Add code
Jan 10, 2026
Viaarxiv icon

CLewR: Curriculum Learning with Restarts for Machine Translation Preference Learning

Add code
Jan 09, 2026
Viaarxiv icon

HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation

Add code
Jan 09, 2026
Viaarxiv icon

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Add code
Jan 09, 2026
Viaarxiv icon

From National Curricula to Cultural Awareness: Constructing Open-Ended Culture-Specific Question Answering Dataset

Add code
Jan 08, 2026
Viaarxiv icon

Multi-Disciplinary Dataset Discovery from Citation-Verified Literature Contexts

Add code
Jan 08, 2026
Viaarxiv icon

PsychEval: A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor

Add code
Jan 08, 2026
Viaarxiv icon

In Search of Grandmother Cells: Tracing Interpretable Neurons in Tabular Representations

Add code
Jan 07, 2026
Viaarxiv icon