Picture for Min-hwan Oh

Min-hwan Oh

Seoul National University

Blessings of Multiple Good Arms in Multi-Objective Linear Bandits

Add code
Feb 13, 2026
Viaarxiv icon

EUGens: Efficient, Unified, and General Dense Layers

Add code
Jan 30, 2026
Viaarxiv icon

Convergence of Muon with Newton-Schulz

Add code
Jan 27, 2026
Viaarxiv icon

Tractable Multinomial Logit Contextual Bandits with Non-Linear Utilities

Add code
Jan 11, 2026
Viaarxiv icon

Infrequent Exploration in Linear Bandits

Add code
Oct 29, 2025
Figure 1 for Infrequent Exploration in Linear Bandits
Figure 2 for Infrequent Exploration in Linear Bandits
Figure 3 for Infrequent Exploration in Linear Bandits
Figure 4 for Infrequent Exploration in Linear Bandits
Viaarxiv icon

Batched Stochastic Matching Bandits

Add code
Sep 04, 2025
Viaarxiv icon

AI Should Sense Better, Not Just Scale Bigger: Adaptive Sensing as a Paradigm Shift

Add code
Jul 10, 2025
Viaarxiv icon

Experimental Design for Semiparametric Bandits

Add code
Jun 16, 2025
Figure 1 for Experimental Design for Semiparametric Bandits
Figure 2 for Experimental Design for Semiparametric Bandits
Figure 3 for Experimental Design for Semiparametric Bandits
Figure 4 for Experimental Design for Semiparametric Bandits
Viaarxiv icon

Dynamic Assortment Selection and Pricing with Censored Preference Feedback

Add code
Apr 03, 2025
Figure 1 for Dynamic Assortment Selection and Pricing with Censored Preference Feedback
Figure 2 for Dynamic Assortment Selection and Pricing with Censored Preference Feedback
Figure 3 for Dynamic Assortment Selection and Pricing with Censored Preference Feedback
Viaarxiv icon

Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning

Add code
Mar 07, 2025
Figure 1 for Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Figure 2 for Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Figure 3 for Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Figure 4 for Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Viaarxiv icon