Alert button
Picture for Miao Lu

Miao Lu

Alert button

Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm

Add code
Bookmark button
Alert button
Apr 04, 2024
Miao Lu, Han Zhong, Tong Zhang, Jose Blanchet

Viaarxiv icon

Benign Oscillation of Stochastic Gradient Descent with Large Learning Rates

Add code
Bookmark button
Alert button
Oct 26, 2023
Miao Lu, Beining Wu, Xiaodong Yang, Difan Zou

Viaarxiv icon

One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration

Add code
Bookmark button
Alert button
May 29, 2023
Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang

Figure 1 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 2 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 3 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 4 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Viaarxiv icon

Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage

Add code
Bookmark button
Alert button
May 16, 2023
Jose Blanchet, Miao Lu, Tong Zhang, Han Zhong

Figure 1 for Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
Viaarxiv icon

Robust Consensus Clustering and its Applications for Advertising Forecasting

Add code
Bookmark button
Alert button
Dec 27, 2022
Deguang Kong, Miao Lu, Konstantin Shmakov, Jian Yang

Figure 1 for Robust Consensus Clustering and its Applications for Advertising Forecasting
Figure 2 for Robust Consensus Clustering and its Applications for Advertising Forecasting
Figure 3 for Robust Consensus Clustering and its Applications for Advertising Forecasting
Figure 4 for Robust Consensus Clustering and its Applications for Advertising Forecasting
Viaarxiv icon

Video Background Music Generation: Dataset, Method and Evaluation

Add code
Bookmark button
Alert button
Nov 21, 2022
Le Zhuo, Zhaokai Wang, Baisen Wang, Yue Liao, Stanley Peng, Chenxi Bao, Miao Lu, Xiaobo Li, Si Liu

Figure 1 for Video Background Music Generation: Dataset, Method and Evaluation
Figure 2 for Video Background Music Generation: Dataset, Method and Evaluation
Figure 3 for Video Background Music Generation: Dataset, Method and Evaluation
Figure 4 for Video Background Music Generation: Dataset, Method and Evaluation
Viaarxiv icon

Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach

Add code
Bookmark button
Alert button
Sep 12, 2022
Miao Lu, Wenhao Yang, Liangyu Zhang, Zhihua Zhang

Viaarxiv icon

Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes

Add code
Bookmark button
Alert button
May 26, 2022
Miao Lu, Yifei Min, Zhaoran Wang, Zhuoran Yang

Figure 1 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Figure 2 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Figure 3 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Figure 4 for Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Viaarxiv icon

GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection

Add code
Bookmark button
Alert button
Apr 14, 2022
Yue Liao, Aixi Zhang, Miao Lu, Yongliang Wang, Xiaobo Li, Si Liu

Figure 1 for GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection
Figure 2 for GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection
Figure 3 for GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection
Figure 4 for GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection
Viaarxiv icon