Picture for Karthik Valmeekam

Karthik Valmeekam

RL in Name Only? Analyzing the Structural Assumptions in RL post-training for LLMs

Add code
May 19, 2025
Viaarxiv icon

Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens

Add code
May 19, 2025
Viaarxiv icon

(How) Do reasoning models reason?

Add code
Apr 14, 2025
Viaarxiv icon

Robust Planning with Compound LLM Architectures: An LLM-Modulo Approach

Add code
Nov 20, 2024
Viaarxiv icon

Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1

Add code
Oct 03, 2024
Figure 1 for Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1
Figure 2 for Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1
Figure 3 for Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1
Figure 4 for Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1
Viaarxiv icon

Robust Planning with LLM-Modulo Framework: Case Study in Travel Planning

Add code
May 31, 2024
Viaarxiv icon

Chain of Thoughtlessness: An Analysis of CoT in Planning

Add code
May 08, 2024
Figure 1 for Chain of Thoughtlessness: An Analysis of CoT in Planning
Figure 2 for Chain of Thoughtlessness: An Analysis of CoT in Planning
Figure 3 for Chain of Thoughtlessness: An Analysis of CoT in Planning
Figure 4 for Chain of Thoughtlessness: An Analysis of CoT in Planning
Viaarxiv icon

On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks

Add code
Feb 12, 2024
Viaarxiv icon

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

Add code
Feb 06, 2024
Figure 1 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Figure 2 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Figure 3 for LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
Viaarxiv icon

Can Large Language Models Really Improve by Self-critiquing Their Own Plans?

Add code
Oct 12, 2023
Viaarxiv icon