Alert button
Picture for Boyi Liu

Boyi Liu

Alert button

$\mathbf{(N,K)}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model

Mar 11, 2024
Yufeng Zhang, Liyu Chen, Boyi Liu, Yingxiang Yang, Qiwen Cui, Yunzhe Tao, Hongxia Yang

Viaarxiv icon

Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning

Feb 16, 2024
Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang

Viaarxiv icon

Improving Efficiency of DNN-based Relocalization Module for Autonomous Driving with Server-side Computing

Dec 01, 2023
Dengbo Li, Jieren Cheng, Boyi Liu

Viaarxiv icon

Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms

Oct 30, 2023
Shenao Zhang, Boyi Liu, Zhaoran Wang, Tuo Zhao

Viaarxiv icon

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Oct 11, 2023
Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang

Figure 1 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 2 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 3 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 4 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Viaarxiv icon

Let Models Speak Ciphers: Multiagent Debate through Embeddings

Oct 10, 2023
Chau Pham, Boyi Liu, Yingxiang Yang, Zhengyu Chen, Tianyi Liu, Jianbo Yuan, Bryan A. Plummer, Zhaoran Wang, Hongxia Yang

Figure 1 for Let Models Speak Ciphers: Multiagent Debate through Embeddings
Figure 2 for Let Models Speak Ciphers: Multiagent Debate through Embeddings
Figure 3 for Let Models Speak Ciphers: Multiagent Debate through Embeddings
Figure 4 for Let Models Speak Ciphers: Multiagent Debate through Embeddings
Viaarxiv icon

Differentiable Arbitrating in Zero-sum Markov Games

Feb 20, 2023
Jing Wang, Meichen Song, Feng Gao, Boyi Liu, Zhaoran Wang, Yi Wu

Figure 1 for Differentiable Arbitrating in Zero-sum Markov Games
Figure 2 for Differentiable Arbitrating in Zero-sum Markov Games
Figure 3 for Differentiable Arbitrating in Zero-sum Markov Games
Figure 4 for Differentiable Arbitrating in Zero-sum Markov Games
Viaarxiv icon

An Efficient Approach to the Online Multi-Agent Path Finding Problem by Using Sustainable Information

Jan 11, 2023
Mingkai Tang, Boyi Liu, Yuanhang Li, Hongji Liu, Ming Liu, Lujia Wang

Figure 1 for An Efficient Approach to the Online Multi-Agent Path Finding Problem by Using Sustainable Information
Figure 2 for An Efficient Approach to the Online Multi-Agent Path Finding Problem by Using Sustainable Information
Figure 3 for An Efficient Approach to the Online Multi-Agent Path Finding Problem by Using Sustainable Information
Figure 4 for An Efficient Approach to the Online Multi-Agent Path Finding Problem by Using Sustainable Information
Viaarxiv icon

An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models

Dec 30, 2022
Yufeng Zhang, Boyi Liu, Qi Cai, Lingxiao Wang, Zhaoran Wang

Figure 1 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Figure 2 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Figure 3 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Figure 4 for An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Viaarxiv icon