Picture for Chris Parnin

Chris Parnin

Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions

Add code
Aug 16, 2024
Figure 1 for Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions
Figure 2 for Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions
Figure 3 for Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions
Figure 4 for Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions
Viaarxiv icon

Exploring Interaction Patterns for Debugging: Enhancing Conversational Capabilities of AI-assistants

Add code
Feb 09, 2024
Viaarxiv icon

Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMs

Add code
Oct 16, 2023
Figure 1 for Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMs
Figure 2 for Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMs
Figure 3 for Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMs
Figure 4 for Tabular Representation, Noisy Operators, and Impacts on Table Structure Understanding Tasks in LLMs
Viaarxiv icon