LLM Agents


LLM agents, or Large Language Model agents, are advanced AI systems that use large language models to reason through a problem, create a plan to solve it, and execute the plan with the help of a set of tools. In other words, they have complex reasoning capabilities, memory, and the ability to execute tasks.

IntenTest: Stress Testing for Intent Integrity in API-Calling LLM Agents

Add code
Jun 09, 2025
Viaarxiv icon

Shapley-Coop: Credit Assignment for Emergent Cooperation in Self-Interested LLM Agents

Add code
Jun 09, 2025
Viaarxiv icon

MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models

Add code
Jun 09, 2025
Viaarxiv icon

G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems

Add code
Jun 09, 2025
Viaarxiv icon

HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization

Add code
Jun 09, 2025
Viaarxiv icon

MCPWorld: A Unified Benchmarking Testbed for API, GUI, and Hybrid Computer Use Agents

Add code
Jun 09, 2025
Viaarxiv icon

ChemAgent: Enhancing LLMs for Chemistry and Materials Science through Tree-Search Based Tool Learning

Add code
Jun 09, 2025
Viaarxiv icon

Learn as Individuals, Evolve as a Team: Multi-agent LLMs Adaptation in Embodied Environments

Add code
Jun 08, 2025
Viaarxiv icon

LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement

Add code
Jun 09, 2025
Viaarxiv icon

SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems

Add code
Jun 09, 2025
Viaarxiv icon