Alert button
Picture for Sirui Zheng

Sirui Zheng

Alert button

How Can LLM Guide RL? A Value-Based Approach

Add code
Bookmark button
Alert button
Feb 25, 2024
Shenao Zhang, Sirui Zheng, Shuqi Ke, Zhihan Liu, Wanxin Jin, Jianbo Yuan, Yingxiang Yang, Hongxia Yang, Zhaoran Wang

Viaarxiv icon

One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration

Add code
Bookmark button
Alert button
May 29, 2023
Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang

Figure 1 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 2 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 3 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 4 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Viaarxiv icon

A Posterior Sampling Framework for Interactive Decision Making

Add code
Bookmark button
Alert button
Nov 03, 2022
Han Zhong, Wei Xiong, Sirui Zheng, Liwei Wang, Zhaoran Wang, Zhuoran Yang, Tong Zhang

Figure 1 for A Posterior Sampling Framework for Interactive Decision Making
Figure 2 for A Posterior Sampling Framework for Interactive Decision Making
Viaarxiv icon