Alert button
Picture for Jiechao Xiong

Jiechao Xiong

Alert button

TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 30, 2020
Peng Sun, Jiechao Xiong, Lei Han, Xinghai Sun, Shuxing Li, Jiawei Xu, Meng Fang, Zhengyou Zhang

Figure 1 for TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Figure 2 for TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Figure 3 for TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Figure 4 for TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Viaarxiv icon

TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game

Add code
Bookmark button
Alert button
Nov 27, 2020
Lei Han, Jiechao Xiong, Peng Sun, Xinghai Sun, Meng Fang, Qingwei Guo, Qiaobo Chen, Tengfei Shi, Hongsheng Yu, Zhengyou Zhang

Figure 1 for TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game
Figure 2 for TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game
Figure 3 for TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game
Figure 4 for TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game
Viaarxiv icon

Zeroth-Order Supervised Policy Improvement

Add code
Bookmark button
Alert button
Jun 11, 2020
Hao Sun, Ziping Xu, Yuhang Song, Meng Fang, Jiechao Xiong, Bo Dai, Zhengyou Zhang, Bolei Zhou

Figure 1 for Zeroth-Order Supervised Policy Improvement
Figure 2 for Zeroth-Order Supervised Policy Improvement
Figure 3 for Zeroth-Order Supervised Policy Improvement
Figure 4 for Zeroth-Order Supervised Policy Improvement
Viaarxiv icon

Arena: a toolkit for Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 20, 2019
Qing Wang, Jiechao Xiong, Lei Han, Meng Fang, Xinghai Sun, Zhuobin Zheng, Peng Sun, Zhengyou Zhang

Figure 1 for Arena: a toolkit for Multi-Agent Reinforcement Learning
Figure 2 for Arena: a toolkit for Multi-Agent Reinforcement Learning
Figure 3 for Arena: a toolkit for Multi-Agent Reinforcement Learning
Figure 4 for Arena: a toolkit for Multi-Agent Reinforcement Learning
Viaarxiv icon

TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game

Add code
Bookmark button
Alert button
Nov 02, 2018
Peng Sun, Xinghai Sun, Lei Han, Jiechao Xiong, Qing Wang, Bo Li, Yang Zheng, Ji Liu, Yongsheng Liu, Han Liu, Tong Zhang

Figure 1 for TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Figure 2 for TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Figure 3 for TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Figure 4 for TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game
Viaarxiv icon

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space

Add code
Bookmark button
Alert button
Oct 10, 2018
Jiechao Xiong, Qing Wang, Zhuoran Yang, Peng Sun, Lei Han, Yang Zheng, Haobo Fu, Tong Zhang, Ji Liu, Han Liu

Figure 1 for Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Figure 2 for Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Figure 3 for Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Figure 4 for Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Viaarxiv icon

A Margin-based MLE for Crowdsourced Partial Ranking

Add code
Bookmark button
Alert button
Jul 29, 2018
Qianqian Xu, Jiechao Xiong, Xinwei Sun, Zhiyong Yang, Xiaochun Cao, Qingming Huang, Yuan Yao

Figure 1 for A Margin-based MLE for Crowdsourced Partial Ranking
Figure 2 for A Margin-based MLE for Crowdsourced Partial Ranking
Figure 3 for A Margin-based MLE for Crowdsourced Partial Ranking
Figure 4 for A Margin-based MLE for Crowdsourced Partial Ranking
Viaarxiv icon

From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation

Add code
Bookmark button
Alert button
Mar 08, 2018
Qianqian Xu, Jiechao Xiong, Xiaochun Cao, Qingming Huang, Yuan Yao

Figure 1 for From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation
Figure 2 for From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation
Figure 3 for From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation
Figure 4 for From Social to Individuals: a Parsimonious Path of Multi-level Models for Crowdsourced Preference Aggregation
Viaarxiv icon

Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size

Add code
Bookmark button
Alert button
Jan 31, 2018
Ke Ma, Jinshan Zeng, Jiechao Xiong, Qianqian Xu, Xiaochun Cao, Wei Liu, Yuan Yao

Figure 1 for Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size
Figure 2 for Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size
Figure 3 for Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size
Figure 4 for Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size
Viaarxiv icon

HodgeRank with Information Maximization for Crowdsourced Pairwise Ranking Aggregation

Add code
Bookmark button
Alert button
Nov 16, 2017
Qianqian Xu, Jiechao Xiong, Xi Chen, Qingming Huang, Yuan Yao

Figure 1 for HodgeRank with Information Maximization for Crowdsourced Pairwise Ranking Aggregation
Figure 2 for HodgeRank with Information Maximization for Crowdsourced Pairwise Ranking Aggregation
Figure 3 for HodgeRank with Information Maximization for Crowdsourced Pairwise Ranking Aggregation
Figure 4 for HodgeRank with Information Maximization for Crowdsourced Pairwise Ranking Aggregation
Viaarxiv icon