Alert button
Picture for Weichao Mao

Weichao Mao

Alert button

Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction

Add code
Bookmark button
Alert button
Apr 12, 2024
Haoran Qiu, Weichao Mao, Archit Patke, Shengkun Cui, Saurabh Jha, Chen Wang, Hubertus Franke, Zbigniew T. Kalbarczyk, Tamer Başar, Ravishankar K. Iyer

Viaarxiv icon

Decision Transformer as a Foundation Model for Partially Observable Continuous Control

Add code
Bookmark button
Alert button
Apr 03, 2024
Xiangyuan Zhang, Weichao Mao, Haoran Qiu, Tamer Başar

Viaarxiv icon

Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms

Add code
Bookmark button
Alert button
Nov 30, 2023
Xiangyuan Zhang, Weichao Mao, Saviz Mowlavi, Mouhacine Benosman, Tamer Başar

Figure 1 for Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms
Figure 2 for Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms
Figure 3 for Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms
Figure 4 for Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms
Viaarxiv icon

Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games

Add code
Bookmark button
Alert button
Oct 21, 2021
Weichao Mao, Tamer Başar

Viaarxiv icon

Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration

Add code
Bookmark button
Alert button
Oct 12, 2021
Weichao Mao, Tamer Başar, Lin F. Yang, Kaiqing Zhang

Figure 1 for Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration
Figure 2 for Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration
Figure 3 for Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration
Figure 4 for Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration
Viaarxiv icon

Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs

Add code
Bookmark button
Alert button
Oct 07, 2020
Weichao Mao, Kaiqing Zhang, Ruihao Zhu, David Simchi-Levi, Tamer Başar

Figure 1 for Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs
Figure 2 for Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs
Figure 3 for Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs
Viaarxiv icon

POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis

Add code
Bookmark button
Alert button
Jun 08, 2020
Weichao Mao, Kaiqing Zhang, Qiaomin Xie, Tamer Başar

Figure 1 for POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
Figure 2 for POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
Viaarxiv icon

Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 18, 2020
Weichao Mao, Kaiqing Zhang, Erik Miehling, Tamer Başar

Figure 1 for Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon