Picture for Akshara Prabhakar

Akshara Prabhakar

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

Add code
Apr 08, 2025
Viaarxiv icon

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

Add code
Mar 31, 2025
Viaarxiv icon

SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

Add code
Nov 20, 2024
Figure 1 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 2 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 3 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 4 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Viaarxiv icon

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Add code
Nov 06, 2024
Viaarxiv icon

CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments

Add code
Nov 04, 2024
Viaarxiv icon

LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

Add code
Oct 16, 2024
Figure 1 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Figure 2 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Figure 3 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Figure 4 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Viaarxiv icon

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Add code
Sep 05, 2024
Figure 1 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 2 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 3 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 4 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Viaarxiv icon

Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning

Add code
Jul 01, 2024
Viaarxiv icon

Language Models as Science Tutors

Add code
Feb 16, 2024
Figure 1 for Language Models as Science Tutors
Figure 2 for Language Models as Science Tutors
Figure 3 for Language Models as Science Tutors
Figure 4 for Language Models as Science Tutors
Viaarxiv icon

InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback

Add code
Jun 27, 2023
Viaarxiv icon