Alert button
Picture for Ying Wen

Ying Wen

Alert button

Critic-Guided Decision Transformer for Offline Reinforcement Learning

Dec 21, 2023
Yuanfu Wang, Chao Yang, Ying Wen, Yu Liu, Yu Qiao

Viaarxiv icon

Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach

Nov 23, 2023
Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, Guoliang Fan

Viaarxiv icon

Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners

Oct 08, 2023
Xihuai Wang, Shao Zhang, Wenhao Zhang, Wentao Dong, Jingxiao Chen, Ying Wen, Weinan Zhang

Figure 1 for Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners
Figure 2 for Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners
Figure 3 for Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners
Figure 4 for Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners
Viaarxiv icon

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models

Oct 08, 2023
Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai

Figure 1 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 2 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 3 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 4 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Viaarxiv icon

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training

Sep 29, 2023
Xidong Feng, Ziyu Wan, Muning Wen, Ying Wen, Weinan Zhang, Jun Wang

Figure 1 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 2 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 3 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 4 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Viaarxiv icon

Cross-Utterance Conditioned VAE for Speech Generation

Sep 08, 2023
Yang Li, Cheng Yu, Guangzhi Sun, Weiqin Zu, Zheng Tian, Ying Wen, Wei Pan, Chao Zhang, Jun Wang, Yang Yang, Fanglei Sun

Figure 1 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 2 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 3 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 4 for Cross-Utterance Conditioned VAE for Speech Generation
Viaarxiv icon

Large Sequence Models for Sequential Decision-Making: A Survey

Jun 24, 2023
Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang

Figure 1 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 2 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 3 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 4 for Large Sequence Models for Sequential Decision-Making: A Survey
Viaarxiv icon

Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination

Jun 05, 2023
Yang Li, Shao Zhang, Jichen Sun, Wenhao Zhang, Yali Du, Ying Wen, Xinbing Wang, Wei Pan

Figure 1 for Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Figure 2 for Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Figure 3 for Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Figure 4 for Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination
Viaarxiv icon

Order Matters: Agent-by-agent Policy Optimization

Feb 26, 2023
Xihuai Wang, Zheng Tian, Ziyu Wan, Ying Wen, Jun Wang, Weinan Zhang

Figure 1 for Order Matters: Agent-by-agent Policy Optimization
Figure 2 for Order Matters: Agent-by-agent Policy Optimization
Figure 3 for Order Matters: Agent-by-agent Policy Optimization
Figure 4 for Order Matters: Agent-by-agent Policy Optimization
Viaarxiv icon