LLM Agents


LLM agents, or Large Language Model agents, are advanced AI systems that use large language models to reason through a problem, create a plan to solve it, and execute the plan with the help of a set of tools. In other words, they have complex reasoning capabilities, memory, and the ability to execute tasks.

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Add code
Sep 10, 2025
Viaarxiv icon

Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations

Add code
Sep 10, 2025
Viaarxiv icon

AutoODD: Agentic Audits via Bayesian Red Teaming in Black-Box Models

Add code
Sep 10, 2025
Viaarxiv icon

ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System

Add code
Sep 10, 2025
Viaarxiv icon

Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents

Add code
Sep 09, 2025
Viaarxiv icon

Guided Reasoning in LLM-Driven Penetration Testing Using Structured Attack Trees

Add code
Sep 09, 2025
Viaarxiv icon

Astra: A Multi-Agent System for GPU Kernel Performance Optimization

Add code
Sep 09, 2025
Viaarxiv icon

Autonomous Code Evolution Meets NP-Completeness

Add code
Sep 09, 2025
Viaarxiv icon

Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals

Add code
Sep 10, 2025
Viaarxiv icon

Getting In Contract with Large Language Models -- An Agency Theory Perspective On Large Language Model Alignment

Add code
Sep 09, 2025
Viaarxiv icon