Picture for Aaron Chan

Aaron Chan

Design and Evaluation of Cost-Aware PoQ for Decentralized LLM Inference

Add code
Dec 18, 2025
Figure 1 for Design and Evaluation of Cost-Aware PoQ for Decentralized LLM Inference
Figure 2 for Design and Evaluation of Cost-Aware PoQ for Decentralized LLM Inference
Figure 3 for Design and Evaluation of Cost-Aware PoQ for Decentralized LLM Inference
Figure 4 for Design and Evaluation of Cost-Aware PoQ for Decentralized LLM Inference
Viaarxiv icon

SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents

Add code
Jun 09, 2025
Viaarxiv icon

Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming

Add code
Feb 22, 2024
Viaarxiv icon

Tailoring Self-Rationalizers with Multi-Reward Distillation

Add code
Nov 06, 2023
Viaarxiv icon

Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language Models

Add code
Oct 07, 2023
Viaarxiv icon

Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales

Add code
May 11, 2023
Figure 1 for Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales
Figure 2 for Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales
Figure 3 for Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales
Figure 4 for Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales
Viaarxiv icon

KNIFE: Knowledge Distillation with Free-Text Rationales

Add code
Dec 19, 2022
Viaarxiv icon

PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales

Add code
Nov 03, 2022
Viaarxiv icon

XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP Models

Add code
Oct 30, 2022
Viaarxiv icon

FRAME: Evaluating Simulatability Metrics for Free-Text Rationales

Add code
Jul 02, 2022
Figure 1 for FRAME: Evaluating Simulatability Metrics for Free-Text Rationales
Figure 2 for FRAME: Evaluating Simulatability Metrics for Free-Text Rationales
Figure 3 for FRAME: Evaluating Simulatability Metrics for Free-Text Rationales
Figure 4 for FRAME: Evaluating Simulatability Metrics for Free-Text Rationales
Viaarxiv icon