Alert button
Picture for Randy Jia

Randy Jia

Alert button

Learning an Inventory Control Policy with General Inventory Arrival Dynamics

Add code
Bookmark button
Alert button
Oct 26, 2023
Sohrab Andaz, Carson Eisenach, Dhruv Madeka, Kari Torkkola, Randy Jia, Dean Foster, Sham Kakade

Figure 1 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 2 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 3 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 4 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Viaarxiv icon

Contextual Bandits for Evaluating and Improving Inventory Control Policies

Add code
Bookmark button
Alert button
Oct 24, 2023
Dean Foster, Randy Jia, Dhruv Madeka

Figure 1 for Contextual Bandits for Evaluating and Improving Inventory Control Policies
Viaarxiv icon

Linear Reinforcement Learning with Ball Structure Action Space

Add code
Bookmark button
Alert button
Nov 14, 2022
Zeyu Jia, Randy Jia, Dhruv Madeka, Dean P. Foster

Viaarxiv icon

Learning in structured MDPs with convex cost functions: Improved regret bounds for inventory management

Add code
Bookmark button
Alert button
May 10, 2019
Shipra Agrawal, Randy Jia

Figure 1 for Learning in structured MDPs with convex cost functions: Improved regret bounds for inventory management
Viaarxiv icon

Posterior sampling for reinforcement learning: worst-case regret bounds

Add code
Bookmark button
Alert button
May 19, 2017
Shipra Agrawal, Randy Jia

Viaarxiv icon