Picture for Junghyun Lee

Junghyun Lee

Regularized Online RLHF with Generalized Bilinear Preferences

Add code
Feb 26, 2026
Viaarxiv icon

A Jointly Efficient and Optimal Algorithm for Heteroskedastic Generalized Linear Bandits with Adversarial Corruptions

Add code
Feb 11, 2026
Viaarxiv icon

Learning to Reason in LLMs by Expectation Maximization

Add code
Dec 23, 2025
Figure 1 for Learning to Reason in LLMs by Expectation Maximization
Figure 2 for Learning to Reason in LLMs by Expectation Maximization
Figure 3 for Learning to Reason in LLMs by Expectation Maximization
Figure 4 for Learning to Reason in LLMs by Expectation Maximization
Viaarxiv icon

AdaSTaR: Adaptive Data Sampling for Training Self-Taught Reasoners

Add code
May 22, 2025
Viaarxiv icon

Probability-Flow ODE in Infinite-Dimensional Function Spaces

Add code
Mar 13, 2025
Viaarxiv icon

FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL

Add code
Oct 21, 2024
Figure 1 for FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Figure 2 for FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Figure 3 for FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Figure 4 for FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Viaarxiv icon

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits

Add code
Jul 19, 2024
Viaarxiv icon

Querying Easily Flip-flopped Samples for Deep Active Learning

Add code
Jan 18, 2024
Figure 1 for Querying Easily Flip-flopped Samples for Deep Active Learning
Figure 2 for Querying Easily Flip-flopped Samples for Deep Active Learning
Figure 3 for Querying Easily Flip-flopped Samples for Deep Active Learning
Figure 4 for Querying Easily Flip-flopped Samples for Deep Active Learning
Viaarxiv icon

Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study

Add code
Nov 25, 2023
Figure 1 for Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study
Figure 2 for Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study
Figure 3 for Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study
Figure 4 for Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study
Viaarxiv icon

Fair Streaming Principal Component Analysis: Statistical and Algorithmic Viewpoint

Add code
Oct 28, 2023
Viaarxiv icon