Alert button
Picture for Shenao Zhang

Shenao Zhang

Alert button

How Can LLM Guide RL? A Value-Based Approach

Add code
Bookmark button
Alert button
Feb 25, 2024
Shenao Zhang, Sirui Zheng, Shuqi Ke, Zhihan Liu, Wanxin Jin, Jianbo Yuan, Yingxiang Yang, Hongxia Yang, Zhaoran Wang

Viaarxiv icon

Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms

Add code
Bookmark button
Alert button
Oct 30, 2023
Shenao Zhang, Boyi Liu, Zhaoran Wang, Tuo Zhao

Viaarxiv icon

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Add code
Bookmark button
Alert button
Oct 11, 2023
Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang

Figure 1 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 2 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 3 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 4 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Viaarxiv icon

One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration

Add code
Bookmark button
Alert button
May 29, 2023
Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang

Figure 1 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 2 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 3 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Figure 4 for One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
Viaarxiv icon

Asking Before Action: Gather Information in Embodied Decision Making with Language Models

Add code
Bookmark button
Alert button
May 25, 2023
Xiaoyu Chen, Shenao Zhang, Pushi Zhang, Li Zhao, Jianyu Chen

Figure 1 for Asking Before Action: Gather Information in Embodied Decision Making with Language Models
Figure 2 for Asking Before Action: Gather Information in Embodied Decision Making with Language Models
Figure 3 for Asking Before Action: Gather Information in Embodied Decision Making with Language Models
Figure 4 for Asking Before Action: Gather Information in Embodied Decision Making with Language Models
Viaarxiv icon

Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 16, 2022
Shenao Zhang

Figure 1 for Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
Figure 2 for Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
Figure 3 for Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
Figure 4 for Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
Viaarxiv icon

Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 30, 2021
Shenao Zhang, Li Shen, Lei Han, Li Shen

Figure 1 for Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning
Figure 2 for Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning
Figure 3 for Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning
Figure 4 for Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning
Viaarxiv icon

Structure-Regularized Attention for Deformable Object Representation

Add code
Bookmark button
Alert button
Jun 12, 2021
Shenao Zhang, Li Shen, Zhifeng Li, Wei Liu

Figure 1 for Structure-Regularized Attention for Deformable Object Representation
Figure 2 for Structure-Regularized Attention for Deformable Object Representation
Figure 3 for Structure-Regularized Attention for Deformable Object Representation
Figure 4 for Structure-Regularized Attention for Deformable Object Representation
Viaarxiv icon