Picture for Aaron Sandoval

Aaron Sandoval

Direct Confidence Alignment: Aligning Verbalized Confidence with Internal Confidence In Large Language Models

Add code
Dec 12, 2025
Viaarxiv icon

Factor(U,T): Controlling Untrusted AI by Monitoring their Plans

Add code
Dec 12, 2025
Viaarxiv icon