Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nischay Singh

Misaligning Reasoning with Answers -- A Framework for Assessing LLM CoT Robustness

May 23, 2025

Enyi Jiang, Changming Xu, Nischay Singh, Gagandeep Singh

Figure 1 for Misaligning Reasoning with Answers -- A Framework for Assessing LLM CoT Robustness

Figure 2 for Misaligning Reasoning with Answers -- A Framework for Assessing LLM CoT Robustness

Figure 3 for Misaligning Reasoning with Answers -- A Framework for Assessing LLM CoT Robustness

Figure 4 for Misaligning Reasoning with Answers -- A Framework for Assessing LLM CoT Robustness

Abstract:LLMs' decision-making process is opaque, prompting the need for explanation techniques like Chain-of-Thought. To investigate the relationship between answer and reasoning, we design a novel evaluation framework, MATCHA. In domains like education and healthcare, reasoning is key for model trustworthiness. MATCHA reveals that LLMs under input perturbations can give inconsistent or nonsensical reasoning. Additionally, we use LLM judges to assess reasoning robustness across models. Our results show that LLMs exhibit greater vulnerability to input perturbations for multi-step and commonsense tasks than compared to logical tasks. Also, we show non-trivial transfer rates of our successful examples to black-box models. Our evaluation framework helps to better understand LLM reasoning mechanisms and guides future models toward more robust and reasoning-driven architectures, enforcing answer-reasoning consistency.

Via

Access Paper or Ask Questions