Picture for Minhyuk Kim

Minhyuk Kim

Beyond Penalizing Mistakes: Stabilizing Efficiency Training in Large Reasoning Models via Adaptive Correct-Only Rewards

Add code
Jun 21, 2026
Viaarxiv icon

CLEAR: Cross-Lingual Enhancement in Alignment via Reverse-training

Add code
Apr 07, 2026
Viaarxiv icon

TORSO: Template-Oriented Reasoning Towards General Tasks

Add code
Sep 11, 2025
Viaarxiv icon

Exploring Coding Spot: Understanding Parametric Contributions to LLM Coding Performance

Add code
Dec 10, 2024
Viaarxiv icon