Picture for Jiheng Zhang

Jiheng Zhang

OR-R1: Automating Modeling and Solving of Operations Research Optimization Problem via Test-Time Reinforcement Learning

Add code
Nov 12, 2025
Viaarxiv icon

Make Optimization Once and for All with Fine-grained Guidance

Add code
Mar 14, 2025
Viaarxiv icon

Parameter-Adaptive Dynamic Pricing

Add code
Mar 02, 2025
Viaarxiv icon

Minimax Optimality in Contextual Dynamic Pricing with General Valuation Models

Add code
Jun 24, 2024
Viaarxiv icon

RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model

Add code
Mar 20, 2024
Viaarxiv icon

Stochastic Graph Bandit Learning with Side-Observations

Add code
Aug 29, 2023
Viaarxiv icon

Provably Efficient Learning in Partially Observable Contextual Bandit

Add code
Aug 07, 2023
Viaarxiv icon

Debiasing Recommendation by Learning Identifiable Latent Confounders

Add code
Feb 10, 2023
Viaarxiv icon

Single-Trajectory Distributionally Robust Reinforcement Learning

Add code
Jan 27, 2023
Figure 1 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 2 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 3 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 4 for Single-Trajectory Distributionally Robust Reinforcement Learning
Viaarxiv icon

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

Add code
Sep 29, 2022
Figure 1 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Figure 2 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Viaarxiv icon