Picture for Sangmook Lee

Sangmook Lee

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Add code
Mar 25, 2026
Viaarxiv icon

Understanding Reasoning in LLMs through Strategic Information Allocation under Uncertainty

Add code
Mar 16, 2026
Viaarxiv icon

Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR

Add code
Feb 13, 2026
Viaarxiv icon

Confidence-Guided Stepwise Model Routing for Cost-Efficient Reasoning

Add code
Nov 09, 2025
Viaarxiv icon

ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection

Add code
May 21, 2025
Viaarxiv icon

PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling

Add code
Nov 10, 2024
Figure 1 for PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling
Figure 2 for PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling
Figure 3 for PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling
Figure 4 for PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling
Viaarxiv icon