LLM Agent


An LLM agent, or Large Language Model agent, is an advanced AI system that uses large language models to reason through a problem, create a plan to solve it, and execute the plan with the help of a set of tools. In other words, it has complex reasoning capabilities, memory, and the ability to execute tasks.

AgentCollab: A Self-Evaluation-Driven Collaboration Paradigm for Efficient LLM Agents

Add code
Mar 27, 2026
Viaarxiv icon

Ask or Assume? Uncertainty-Aware Clarification-Seeking in Coding Agents

Add code
Mar 27, 2026
Viaarxiv icon

Can AI Scientist Agents Learn from Lab-in-the-Loop Feedback? Evidence from Iterative Perturbation Discovery

Add code
Mar 27, 2026
Viaarxiv icon

FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol

Add code
Mar 26, 2026
Viaarxiv icon

MemoryCD: Benchmarking Long-Context User Memory of LLM Agents for Lifelong Cross-Domain Personalization

Add code
Mar 26, 2026
Viaarxiv icon

The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities

Add code
Mar 26, 2026
Viaarxiv icon

AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study

Add code
Mar 26, 2026
Viaarxiv icon

Learning to Commit: Generating Organic Pull Requests via Online Repository Memory

Add code
Mar 27, 2026
Viaarxiv icon

Clawed and Dangerous: Can We Trust Open Agentic Systems?

Add code
Mar 27, 2026
Viaarxiv icon

The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

Add code
Mar 26, 2026
Viaarxiv icon