LLM


A large language model (LLM) is a computational model notable for its ability to achieve general-purpose language generation and other natural language processing tasks such as classification. Based on language models, LLMs acquire these abilities by learning statistical relationships from vast amounts of text during a computationally intensive self-supervised and semi-supervised training process.

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation

Add code
Sep 10, 2025
Viaarxiv icon

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Add code
Sep 10, 2025
Viaarxiv icon

ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System

Add code
Sep 10, 2025
Viaarxiv icon

X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates

Add code
Sep 10, 2025
Viaarxiv icon

Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations

Add code
Sep 10, 2025
Viaarxiv icon

DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge

Add code
Sep 10, 2025
Viaarxiv icon

That's So FETCH: Fashioning Ensemble Techniques for LLM Classification in Civil Legal Intake and Referral

Add code
Sep 10, 2025
Viaarxiv icon

Exploratory Retrieval-Augmented Planning For Continual Embodied Instruction Following

Add code
Sep 10, 2025
Viaarxiv icon

Evaluating LLMs Without Oracle Feedback: Agentic Annotation Evaluation Through Unsupervised Consistency Signals

Add code
Sep 10, 2025
Viaarxiv icon