Alert button
Picture for Yanwei Jia

Yanwei Jia

Alert button

Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty

Add code
Bookmark button
Alert button
Apr 19, 2024
Yanwei Jia

Viaarxiv icon

Learning Merton's Strategies in an Incomplete Market: Recursive Entropy Regularization and Biased Gaussian Exploration

Add code
Bookmark button
Alert button
Dec 19, 2023
Min Dai, Yuchao Dong, Yanwei Jia, Xun Yu Zhou

Viaarxiv icon

q-Learning in Continuous Time

Add code
Bookmark button
Alert button
Jul 02, 2022
Yanwei Jia, Xun Yu Zhou

Figure 1 for q-Learning in Continuous Time
Figure 2 for q-Learning in Continuous Time
Figure 3 for q-Learning in Continuous Time
Figure 4 for q-Learning in Continuous Time
Viaarxiv icon

Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms

Add code
Bookmark button
Alert button
Nov 22, 2021
Yanwei Jia, Xun Yu Zhou

Figure 1 for Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Figure 2 for Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Viaarxiv icon

Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach

Add code
Bookmark button
Alert button
Aug 15, 2021
Yanwei Jia, Xun Yu Zhou

Figure 1 for Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach
Figure 2 for Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach
Figure 3 for Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach
Figure 4 for Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach
Viaarxiv icon