Picture for Kaya Stechly

Kaya Stechly

Performative Thinking? The Brittle Correlation Between CoT Length and Problem Complexity

Add code
Sep 09, 2025
Viaarxiv icon

RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs

Add code
May 19, 2025
Figure 1 for RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs
Figure 2 for RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs
Figure 3 for RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs
Viaarxiv icon

Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens

Add code
May 19, 2025
Figure 1 for Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens
Figure 2 for Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens
Figure 3 for Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens
Figure 4 for Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens
Viaarxiv icon

(How) Do reasoning models reason?

Add code
Apr 14, 2025
Viaarxiv icon

Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1

Add code
Oct 03, 2024
Figure 1 for Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1
Figure 2 for Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1
Figure 3 for Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1
Figure 4 for Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1
Viaarxiv icon

Chain of Thoughtlessness: An Analysis of CoT in Planning

Add code
May 08, 2024
Figure 1 for Chain of Thoughtlessness: An Analysis of CoT in Planning
Figure 2 for Chain of Thoughtlessness: An Analysis of CoT in Planning
Figure 3 for Chain of Thoughtlessness: An Analysis of CoT in Planning
Figure 4 for Chain of Thoughtlessness: An Analysis of CoT in Planning
Viaarxiv icon

On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks

Add code
Feb 12, 2024
Figure 1 for On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks
Figure 2 for On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks
Figure 3 for On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks
Figure 4 for On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks
Viaarxiv icon

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

Add code
Feb 06, 2024
Figure 1 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Figure 2 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Figure 3 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Viaarxiv icon

GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems

Add code
Oct 19, 2023
Figure 1 for GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems
Figure 2 for GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems
Figure 3 for GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems
Figure 4 for GPT-4 Doesn't Know It's Wrong: An Analysis of Iterative Prompting for Reasoning Problems
Viaarxiv icon