Picture for Nathaniel Daw

Nathaniel Daw

How LLMs Detect and Correct Their Own Errors: The Role of Internal Confidence Signals

Add code
Apr 24, 2026
Viaarxiv icon

Causal Evidence that Language Models use Confidence to Drive Behavior

Add code
Mar 23, 2026
Viaarxiv icon

Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis

Add code
Jun 29, 2023
Figure 1 for Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Figure 2 for Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Figure 3 for Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Figure 4 for Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Viaarxiv icon