Alert button
Picture for Zhaoran Wang

Zhaoran Wang

Alert button

Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms

Add code
Bookmark button
Alert button
Oct 30, 2023
Shenao Zhang, Boyi Liu, Zhaoran Wang, Tuo Zhao

Viaarxiv icon

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

Add code
Bookmark button
Alert button
Oct 30, 2023
Shuang Qiu, Ziyu Dai, Han Zhong, Zhaoran Wang, Zhuoran Yang, Tong Zhang

Viaarxiv icon

Learning Regularized Graphon Mean-Field Games with Unknown Graphons

Add code
Bookmark button
Alert button
Oct 26, 2023
Fengzhuo Zhang, Vincent Y. F. Tan, Zhaoran Wang, Zhuoran Yang

Viaarxiv icon

Learning Regularized Monotone Graphon Mean-Field Games

Add code
Bookmark button
Alert button
Oct 12, 2023
Fengzhuo Zhang, Vincent Y. F. Tan, Zhaoran Wang, Zhuoran Yang

Figure 1 for Learning Regularized Monotone Graphon Mean-Field Games
Figure 2 for Learning Regularized Monotone Graphon Mean-Field Games
Viaarxiv icon

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Add code
Bookmark button
Alert button
Oct 11, 2023
Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang

Figure 1 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 2 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 3 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 4 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Viaarxiv icon

Let Models Speak Ciphers: Multiagent Debate through Embeddings

Add code
Bookmark button
Alert button
Oct 10, 2023
Chau Pham, Boyi Liu, Yingxiang Yang, Zhengyu Chen, Tianyi Liu, Jianbo Yuan, Bryan A. Plummer, Zhaoran Wang, Hongxia Yang

Figure 1 for Let Models Speak Ciphers: Multiagent Debate through Embeddings
Figure 2 for Let Models Speak Ciphers: Multiagent Debate through Embeddings
Figure 3 for Let Models Speak Ciphers: Multiagent Debate through Embeddings
Figure 4 for Let Models Speak Ciphers: Multiagent Debate through Embeddings
Viaarxiv icon

Sample-Efficient Multi-Agent RL: An Optimization Perspective

Add code
Bookmark button
Alert button
Oct 10, 2023
Nuoya Xiong, Zhihan Liu, Zhaoran Wang, Zhuoran Yang

Viaarxiv icon

Contextual Dynamic Pricing with Strategic Buyers

Add code
Bookmark button
Alert button
Jul 08, 2023
Pangpang Liu, Zhuoran Yang, Zhaoran Wang, Will Wei Sun

Figure 1 for Contextual Dynamic Pricing with Strategic Buyers
Figure 2 for Contextual Dynamic Pricing with Strategic Buyers
Figure 3 for Contextual Dynamic Pricing with Strategic Buyers
Figure 4 for Contextual Dynamic Pricing with Strategic Buyers
Viaarxiv icon

A General Framework for Sequential Decision-Making under Adaptivity Constraints

Add code
Bookmark button
Alert button
Jun 27, 2023
Nuoya Xiong, Zhaoran Wang, Zhuoran Yang

Viaarxiv icon