Alert button
Picture for Xiao-Yue Gong

Xiao-Yue Gong

Alert button

Asymptotically Optimal Pure Exploration for Infinite-Armed Bandits

Add code
Bookmark button
Alert button
Jun 03, 2023
Xiao-Yue Gong, Mark Sellke

Viaarxiv icon

Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings

Add code
Bookmark button
Alert button
Jun 30, 2020
Xiao-Yue Gong, David Simchi-Levi

Figure 1 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Figure 2 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Figure 3 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Figure 4 for Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings
Viaarxiv icon

Efficient Entropy for Policy Gradient with Multidimensional Action Space

Add code
Bookmark button
Alert button
Jun 02, 2018
Yiming Zhang, Quan Ho Vuong, Kenny Song, Xiao-Yue Gong, Keith W. Ross

Figure 1 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 2 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 3 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 4 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Viaarxiv icon