Picture for Maxwell Crouse

Maxwell Crouse

NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls

Add code
Sep 04, 2024
Viaarxiv icon

Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

Add code
Jun 27, 2024
Viaarxiv icon

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Add code
May 07, 2024
Figure 1 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Figure 2 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Figure 3 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Figure 4 for Granite Code Models: A Family of Open Foundation Models for Code Intelligence
Viaarxiv icon

API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs

Add code
Feb 23, 2024
Viaarxiv icon

Formally Specifying the High-Level Behavior of LLM-Based Agents

Add code
Oct 12, 2023
Viaarxiv icon

Compositional Program Generation for Systematic Generalization

Add code
Sep 28, 2023
Viaarxiv icon

MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error Types

Add code
Jun 18, 2023
Viaarxiv icon

Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency

Add code
May 31, 2023
Viaarxiv icon

An Ensemble Approach for Automated Theorem Proving Based on Efficient Name Invariant Graph Neural Representations

Add code
May 15, 2023
Viaarxiv icon

Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing

Add code
May 07, 2023
Figure 1 for Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing
Figure 2 for Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing
Figure 3 for Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing
Figure 4 for Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing
Viaarxiv icon