Picture for Kuikun Liu

Kuikun Liu

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Add code
Dec 12, 2025
Viaarxiv icon

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

Add code
Dec 12, 2025
Viaarxiv icon

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

Add code
Jul 22, 2025
Figure 1 for Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning
Figure 2 for Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning
Figure 3 for Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning
Figure 4 for Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning
Viaarxiv icon

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

Add code
Jul 17, 2025
Viaarxiv icon

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Add code
Mar 31, 2025
Viaarxiv icon

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Add code
Feb 10, 2025
Viaarxiv icon

Are Your LLMs Capable of Stable Reasoning?

Add code
Dec 17, 2024
Figure 1 for Are Your LLMs Capable of Stable Reasoning?
Figure 2 for Are Your LLMs Capable of Stable Reasoning?
Figure 3 for Are Your LLMs Capable of Stable Reasoning?
Figure 4 for Are Your LLMs Capable of Stable Reasoning?
Viaarxiv icon

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Add code
Jul 29, 2024
Figure 1 for MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Figure 2 for MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Figure 3 for MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Figure 4 for MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Viaarxiv icon

CIBench: Evaluating Your LLMs with a Code Interpreter Plugin

Add code
Jul 15, 2024
Figure 1 for CIBench: Evaluating Your LLMs with a Code Interpreter Plugin
Figure 2 for CIBench: Evaluating Your LLMs with a Code Interpreter Plugin
Figure 3 for CIBench: Evaluating Your LLMs with a Code Interpreter Plugin
Figure 4 for CIBench: Evaluating Your LLMs with a Code Interpreter Plugin
Viaarxiv icon

AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data

Add code
May 29, 2024
Viaarxiv icon