Alert button
Picture for Tianchi Cai

Tianchi Cai

Alert button

ULMA: Unified Language Model Alignment with Demonstration and Point-wise Human Preference

Add code
Bookmark button
Alert button
Dec 05, 2023
Tianchi Cai, Xierui Song, Jiyan Jiang, Fei Teng, Jinjie Gu, Guannan Zhang

Figure 1 for ULMA: Unified Language Model Alignment with Demonstration and Point-wise Human Preference
Figure 2 for ULMA: Unified Language Model Alignment with Demonstration and Point-wise Human Preference
Figure 3 for ULMA: Unified Language Model Alignment with Demonstration and Point-wise Human Preference
Figure 4 for ULMA: Unified Language Model Alignment with Demonstration and Point-wise Human Preference
Viaarxiv icon

Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 06, 2023
Tianchi Cai, Jiyan Jiang, Wenpeng Zhang, Shiji Zhou, Xierui Song, Li Yu, Lihong Gu, Xiaodong Zeng, Jinjie Gu, Guannan Zhang

Figure 1 for Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning
Figure 2 for Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning
Figure 3 for Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning
Figure 4 for Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning
Viaarxiv icon

Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems

Add code
Bookmark button
Alert button
Aug 25, 2023
Tianchi Cai, Shenliao Bao, Jiyan Jiang, Shiji Zhou, Wenpeng Zhang, Lihong Gu, Jinjie Gu, Guannan Zhang

Figure 1 for Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
Figure 2 for Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
Figure 3 for Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
Figure 4 for Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems
Viaarxiv icon

A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 29, 2021
Tianchi Cai, Wenpeng Zhang, Lihong Gu, Xiaodong Zeng, Jinjie Gu

Figure 1 for A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning
Figure 2 for A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning
Figure 3 for A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning
Figure 4 for A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning
Viaarxiv icon

LinkLouvain: Link-Aware A/B Testing and Its Application on Online Marketing Campaign

Add code
Bookmark button
Alert button
Feb 03, 2021
Tianchi Cai, Daxi Cheng, Chen Liang, Ziqi Liu, Lihong Gu, Huizhi Xie, Zhiqiang Zhang, Xiaodong Zeng, Jinjie Gu

Figure 1 for LinkLouvain: Link-Aware A/B Testing and Its Application on Online Marketing Campaign
Figure 2 for LinkLouvain: Link-Aware A/B Testing and Its Application on Online Marketing Campaign
Figure 3 for LinkLouvain: Link-Aware A/B Testing and Its Application on Online Marketing Campaign
Figure 4 for LinkLouvain: Link-Aware A/B Testing and Its Application on Online Marketing Campaign
Viaarxiv icon