Alert button
Picture for Li Xia

Li Xia

Alert button

Global Algorithms for Mean-Variance Optimization in Markov Decision Processes

Feb 27, 2023
Li Xia, Shuai Ma

Figure 1 for Global Algorithms for Mean-Variance Optimization in Markov Decision Processes
Figure 2 for Global Algorithms for Mean-Variance Optimization in Markov Decision Processes
Figure 3 for Global Algorithms for Mean-Variance Optimization in Markov Decision Processes
Figure 4 for Global Algorithms for Mean-Variance Optimization in Markov Decision Processes
Viaarxiv icon

Risk-Sensitive Markov Decision Processes with Long-Run CVaR Criterion

Oct 17, 2022
Li Xia, Peter W. Glynn

Figure 1 for Risk-Sensitive Markov Decision Processes with Long-Run CVaR Criterion
Figure 2 for Risk-Sensitive Markov Decision Processes with Long-Run CVaR Criterion
Figure 3 for Risk-Sensitive Markov Decision Processes with Long-Run CVaR Criterion
Figure 4 for Risk-Sensitive Markov Decision Processes with Long-Run CVaR Criterion
Viaarxiv icon

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

Sep 29, 2022
Xiaoteng Ma, Zhipeng Liang, Jose Blanchet, Mingwen Liu, Li Xia, Jiheng Zhang, Qianchuan Zhao, Zhengyuan Zhou

Figure 1 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Figure 2 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning

Jun 15, 2022
Xiaoteng Ma, Shuai Ma, Li Xia, Qianchuan Zhao

Figure 1 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Figure 2 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Figure 3 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Figure 4 for Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Viaarxiv icon

A unified algorithm framework for mean-variance optimization in discounted Markov decision processes

Jan 15, 2022
Shuai Ma, Xiaoteng Ma, Li Xia

Viaarxiv icon

Average-Reward Reinforcement Learning with Trust Region Methods

Jun 07, 2021
Xiaoteng Ma, Xiaohang Tang, Li Xia, Jun Yang, Qianchuan Zhao

Figure 1 for Average-Reward Reinforcement Learning with Trust Region Methods
Figure 2 for Average-Reward Reinforcement Learning with Trust Region Methods
Figure 3 for Average-Reward Reinforcement Learning with Trust Region Methods
Figure 4 for Average-Reward Reinforcement Learning with Trust Region Methods
Viaarxiv icon

Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance

Aug 09, 2020
Li Xia

Figure 1 for Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance
Figure 2 for Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance
Figure 3 for Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance
Figure 4 for Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance
Viaarxiv icon

Embedding-based Retrieval in Facebook Search

Jul 29, 2020
Jui-Ting Huang, Ashish Sharma, Shuying Sun, Li Xia, David Zhang, Philip Pronin, Janani Padmanabhan, Giuseppe Ottaviano, Linjun Yang

Figure 1 for Embedding-based Retrieval in Facebook Search
Figure 2 for Embedding-based Retrieval in Facebook Search
Figure 3 for Embedding-based Retrieval in Facebook Search
Figure 4 for Embedding-based Retrieval in Facebook Search
Viaarxiv icon

SOAC: The Soft Option Actor-Critic Architecture

Jun 25, 2020
Chenghao Li, Xiaoteng Ma, Chongjie Zhang, Jun Yang, Li Xia, Qianchuan Zhao

Figure 1 for SOAC: The Soft Option Actor-Critic Architecture
Figure 2 for SOAC: The Soft Option Actor-Critic Architecture
Figure 3 for SOAC: The Soft Option Actor-Critic Architecture
Figure 4 for SOAC: The Soft Option Actor-Critic Architecture
Viaarxiv icon