Alert button
Picture for Dan Qiao

Dan Qiao

Alert button

Differentially Private Reinforcement Learning with Self-Play

Add code
Bookmark button
Alert button
Apr 11, 2024
Dan Qiao, Yu-Xiang Wang

Viaarxiv icon

Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints

Add code
Bookmark button
Alert button
Feb 02, 2024
Dan Qiao, Yu-Xiang Wang

Viaarxiv icon

OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch

Add code
Bookmark button
Alert button
Oct 01, 2023
Juntao Li, Zecheng Tang, Yuyang Ding, Pinzheng Wang, Pei Guo, Wangjie You, Dan Qiao, Wenliang Chen, Guohong Fu, Qiaoming Zhu, Guodong Zhou, Min Zhang

Figure 1 for OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch
Figure 2 for OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch
Figure 3 for OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch
Figure 4 for OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch
Viaarxiv icon

GameEval: Evaluating LLMs on Conversational Games

Add code
Bookmark button
Alert button
Aug 19, 2023
Dan Qiao, Chenfei Wu, Yaobo Liang, Juntao Li, Nan Duan

Figure 1 for GameEval: Evaluating LLMs on Conversational Games
Figure 2 for GameEval: Evaluating LLMs on Conversational Games
Figure 3 for GameEval: Evaluating LLMs on Conversational Games
Figure 4 for GameEval: Evaluating LLMs on Conversational Games
Viaarxiv icon

Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
May 18, 2023
Wenhao Li, Dan Qiao, Baoxiang Wang, Xiangfeng Wang, Bo Jin, Hongyuan Zha

Figure 1 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Figure 2 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Figure 3 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Figure 4 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Viaarxiv icon

Logarithmic Switching Cost in Reinforcement Learning beyond Linear MDPs

Add code
Bookmark button
Alert button
Feb 24, 2023
Dan Qiao, Ming Yin, Yu-Xiang Wang

Figure 1 for Logarithmic Switching Cost in Reinforcement Learning beyond Linear MDPs
Viaarxiv icon

Near-Optimal Differentially Private Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 09, 2022
Dan Qiao, Yu-Xiang Wang

Figure 1 for Near-Optimal Differentially Private Reinforcement Learning
Viaarxiv icon

SelfMix: Robust Learning Against Textual Label Noise with Self-Mixup Training

Add code
Bookmark button
Alert button
Oct 11, 2022
Dan Qiao, Chenchen Dai, Yuyang Ding, Juntao Li, Qiang Chen, Wenliang Chen, Min Zhang

Figure 1 for SelfMix: Robust Learning Against Textual Label Noise with Self-Mixup Training
Figure 2 for SelfMix: Robust Learning Against Textual Label Noise with Self-Mixup Training
Figure 3 for SelfMix: Robust Learning Against Textual Label Noise with Self-Mixup Training
Figure 4 for SelfMix: Robust Learning Against Textual Label Noise with Self-Mixup Training
Viaarxiv icon

Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Oct 03, 2022
Dan Qiao, Yu-Xiang Wang

Figure 1 for Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation
Figure 2 for Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation
Viaarxiv icon