Picture for Jeonghye Kim

Jeonghye Kim

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Add code
Mar 25, 2026
Viaarxiv icon

Understanding Reasoning in LLMs through Strategic Information Allocation under Uncertainty

Add code
Mar 16, 2026
Viaarxiv icon

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Add code
Feb 26, 2026
Viaarxiv icon

Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR

Add code
Feb 13, 2026
Viaarxiv icon

Align While Search: Belief-Guided Exploratory Inference for World-Grounded Embodied Agents

Add code
Dec 30, 2025
Viaarxiv icon

RAISE: Enhancing Scientific Reasoning in LLMs via Step-by-Step Retrieval

Add code
Jun 10, 2025
Viaarxiv icon

ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection

Add code
May 21, 2025
Viaarxiv icon

Value-Aided Conditional Supervised Learning for Offline RL

Add code
Feb 03, 2024
Viaarxiv icon

Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making

Add code
Oct 06, 2023
Figure 1 for Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
Figure 2 for Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
Figure 3 for Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
Figure 4 for Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
Viaarxiv icon

LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework

Add code
Oct 05, 2023
Viaarxiv icon