Alert button
Picture for Xidong Feng

Xidong Feng

Alert button

Natural Language Reinforcement Learning

Feb 14, 2024
Xidong Feng, Ziyu Wan, Mengyue Yang, Ziyan Wang, Girish A. Koushik, Yali Du, Ying Wen, Jun Wang

Viaarxiv icon

Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

Feb 05, 2024
Zhiyuan Hu, Chumin Liu, Xidong Feng, Yilun Zhao, See-Kiong Ng, Anh Tuan Luu, Junxian He, Pang Wei Koh, Bryan Hooi

Viaarxiv icon

Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

Dec 22, 2023
Filippos Christianos, Georgios Papoudakis, Matthieu Zimmer, Thomas Coste, Zhihao Wu, Jingxuan Chen, Khyati Khandelwal, James Doran, Xidong Feng, Jiacheng Liu, Zheng Xiong, Yicheng Luo, Jianye Hao, Kun Shao, Haitham Bou-Ammar, Jun Wang

Viaarxiv icon

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training

Sep 29, 2023
Xidong Feng, Ziyu Wan, Muning Wen, Ying Wen, Weinan Zhang, Jun Wang

Figure 1 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 2 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 3 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 4 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Viaarxiv icon

ChessGPT: Bridging Policy Learning and Language Modeling

Jun 15, 2023
Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang

Viaarxiv icon

Contextual Transformer for Offline Meta Reinforcement Learning

Nov 15, 2022
Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang

Figure 1 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 2 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 3 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 4 for Contextual Transformer for Offline Meta Reinforcement Learning
Viaarxiv icon

TorchOpt: An Efficient Library for Differentiable Optimization

Nov 13, 2022
Jie Ren, Xidong Feng, Bo Liu, Xuehai Pan, Yao Fu, Luo Mai, Yaodong Yang

Figure 1 for TorchOpt: An Efficient Library for Differentiable Optimization
Figure 2 for TorchOpt: An Efficient Library for Differentiable Optimization
Figure 3 for TorchOpt: An Efficient Library for Differentiable Optimization
Figure 4 for TorchOpt: An Efficient Library for Differentiable Optimization
Viaarxiv icon

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL

Aug 02, 2022
Jakub Grudzien Kuba, Xidong Feng, Shiyao Ding, Hao Dong, Jun Wang, Yaodong Yang

Figure 1 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 2 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 3 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 4 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Viaarxiv icon

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning

Jun 17, 2022
Yuanpei Chen, Yaodong Yang, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuang Jiang, Stephen Marcus McAleer, Hao Dong, Zongqing Lu, Song-Chun Zhu

Figure 1 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Figure 2 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Figure 3 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Figure 4 for Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Viaarxiv icon