Picture for Min-hwan Oh

Min-hwan Oh

Seoul National University

EUGens: Efficient, Unified, and General Dense Layers

Add code
Jan 30, 2026
Viaarxiv icon

Convergence of Muon with Newton-Schulz

Add code
Jan 27, 2026
Viaarxiv icon

Tractable Multinomial Logit Contextual Bandits with Non-Linear Utilities

Add code
Jan 11, 2026
Viaarxiv icon

Infrequent Exploration in Linear Bandits

Add code
Oct 29, 2025
Viaarxiv icon

Batched Stochastic Matching Bandits

Add code
Sep 04, 2025
Viaarxiv icon

AI Should Sense Better, Not Just Scale Bigger: Adaptive Sensing as a Paradigm Shift

Add code
Jul 10, 2025
Viaarxiv icon

Experimental Design for Semiparametric Bandits

Add code
Jun 16, 2025
Viaarxiv icon

Dynamic Assortment Selection and Pricing with Censored Preference Feedback

Add code
Apr 03, 2025
Figure 1 for Dynamic Assortment Selection and Pricing with Censored Preference Feedback
Figure 2 for Dynamic Assortment Selection and Pricing with Censored Preference Feedback
Figure 3 for Dynamic Assortment Selection and Pricing with Censored Preference Feedback
Viaarxiv icon

Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning

Add code
Mar 07, 2025
Figure 1 for Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Figure 2 for Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Figure 3 for Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Figure 4 for Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Viaarxiv icon

Linear Bandits with Partially Observable Features

Add code
Feb 10, 2025
Viaarxiv icon