Alert button
Picture for Jing-Cheng Pang

Jing-Cheng Pang

Alert button

Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts

Add code
Bookmark button
Alert button
Apr 14, 2024
Jing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang, Yang Yu

Viaarxiv icon

Empowering Language Models with Active Inquiry for Deeper Understanding

Add code
Bookmark button
Alert button
Feb 06, 2024
Jing-Cheng Pang, Heng-Bo Fan, Pengyuan Wang, Jia-Hao Xiao, Nan Tang, Si-Hang Yang, Chengxing Jia, Sheng-Jun Huang, Yang Yu

Viaarxiv icon

Language Model Self-improvement by Reinforcement Learning Contemplation

Add code
Bookmark button
Alert button
May 23, 2023
Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu, Zongzhang Zhang, Yang Yu

Figure 1 for Language Model Self-improvement by Reinforcement Learning Contemplation
Figure 2 for Language Model Self-improvement by Reinforcement Learning Contemplation
Figure 3 for Language Model Self-improvement by Reinforcement Learning Contemplation
Figure 4 for Language Model Self-improvement by Reinforcement Learning Contemplation
Viaarxiv icon

Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation

Add code
Bookmark button
Alert button
Feb 18, 2023
Jing-Cheng Pang, Xin-Yu Yang, Si-Hang Yang, Yang Yu

Figure 1 for Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation
Figure 2 for Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation
Figure 3 for Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation
Figure 4 for Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation
Viaarxiv icon

Regret Minimization Experience Replay

Add code
Bookmark button
Alert button
Jun 06, 2021
Zhenghai Xue, Xu-Hui Liu, Jing-Cheng Pang, Shengyi Jiang, Feng Xu, Yang Yu

Figure 1 for Regret Minimization Experience Replay
Figure 2 for Regret Minimization Experience Replay
Figure 3 for Regret Minimization Experience Replay
Figure 4 for Regret Minimization Experience Replay
Viaarxiv icon

Sparsity Prior Regularized Q-learning for Sparse Action Tasks

Add code
Bookmark button
Alert button
May 19, 2021
Jing-Cheng Pang, Tian Xu, Sheng-Yi Jiang, Yu-Ren Liu, Yang Yu

Figure 1 for Sparsity Prior Regularized Q-learning for Sparse Action Tasks
Figure 2 for Sparsity Prior Regularized Q-learning for Sparse Action Tasks
Figure 3 for Sparsity Prior Regularized Q-learning for Sparse Action Tasks
Figure 4 for Sparsity Prior Regularized Q-learning for Sparse Action Tasks
Viaarxiv icon

Improving Fictitious Play Reinforcement Learning with Expanding Models

Add code
Bookmark button
Alert button
Nov 28, 2019
Rong-Jun Qin, Jing-Cheng Pang, Yang Yu

Figure 1 for Improving Fictitious Play Reinforcement Learning with Expanding Models
Figure 2 for Improving Fictitious Play Reinforcement Learning with Expanding Models
Figure 3 for Improving Fictitious Play Reinforcement Learning with Expanding Models
Figure 4 for Improving Fictitious Play Reinforcement Learning with Expanding Models
Viaarxiv icon