Picture for Chad DeLuca

Chad DeLuca

A Systematic Approach for Large Language Models Debugging

Add code
Apr 24, 2026
Viaarxiv icon

STaD: Scaffolded Task Design for Identifying Compositional Skill Gaps in LLMs

Add code
Apr 21, 2026
Viaarxiv icon

LogitScope: A Framework for Analyzing LLM Uncertainty Through Information Metrics

Add code
Mar 26, 2026
Viaarxiv icon

LLMON: An LLM-native Markup Language to Leverage Structure and Semantics at the LLM Interface

Add code
Mar 23, 2026
Viaarxiv icon

MermaidSeqBench: An Evaluation Benchmark for LLM-to-Mermaid Sequence Diagram Generation

Add code
Nov 18, 2025
Viaarxiv icon