Alert button
Picture for Bo Tang

Bo Tang

Alert button

Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving

Add code
Bookmark button
Alert button
Mar 27, 2024
Xuemin Hu, Pan Chen, Yijun Wen, Bo Tang, Long Chen

Viaarxiv icon

Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy

Add code
Bookmark button
Alert button
Mar 07, 2024
Yu Zhu, Chuxiong Sun, Wenfei Yang, Wenqiang Wei, Bo Tang, Tianzhu Zhang, Zhiyu Li, Shifeng Zhang, Feiyu Xiong, Jie Hu, Mingchuan yang

Figure 1 for Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
Figure 2 for Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
Figure 3 for Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
Figure 4 for Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
Viaarxiv icon

NewsBench: Systematic Evaluation of LLMs for Writing Proficiency and Safety Adherence in Chinese Journalistic Editorial Applications

Add code
Bookmark button
Alert button
Feb 29, 2024
Miao Li, Ming-Bin Chen, Bo Tang, Shengbin Hou, Pengyu Wang, Haiying Deng, Zhiyu Li, Feiyu Xiong, Keming Mao, Peng Cheng, Yi Luo

Figure 1 for NewsBench: Systematic Evaluation of LLMs for Writing Proficiency and Safety Adherence in Chinese Journalistic Editorial Applications
Figure 2 for NewsBench: Systematic Evaluation of LLMs for Writing Proficiency and Safety Adherence in Chinese Journalistic Editorial Applications
Figure 3 for NewsBench: Systematic Evaluation of LLMs for Writing Proficiency and Safety Adherence in Chinese Journalistic Editorial Applications
Figure 4 for NewsBench: Systematic Evaluation of LLMs for Writing Proficiency and Safety Adherence in Chinese Journalistic Editorial Applications
Viaarxiv icon

Debiasing Recommendation with Personal Popularity

Add code
Bookmark button
Alert button
Feb 21, 2024
Wentao Ning, Reynold Cheng, Xiao Yan, Ben Kao, Nan Huo, Nur AI Hasan Haldar, Bo Tang

Viaarxiv icon

Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs

Add code
Bookmark button
Alert button
Feb 17, 2024
Xun Liang, Hanyu Wang, Shichao Song, Mengting Hu, Xunzhi Wang, Zhiyu Li, Feiyu Xiong, Bo Tang

Viaarxiv icon

CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models

Add code
Bookmark button
Alert button
Jan 30, 2024
Yuanjie Lyu, Zhiyu Li, Simin Niu, Feiyu Xiong, Bo Tang, Wenjin Wang, Hao Wu, Huanyong Liu, Tong Xu, Enhong Chen

Viaarxiv icon

Off-Policy Primal-Dual Safe Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 26, 2024
Zifan Wu, Bo Tang, Qian Lin, Chao Yu, Shangqin Mao, Qianlong Xie, Xingxing Wang, Dong Wang

Viaarxiv icon

Grimoire is All You Need for Enhancing Large Language Models

Add code
Bookmark button
Alert button
Jan 10, 2024
Ding Chen, Shichao Song, Qingchen Yu, Zhiyu Li, Wenjin Wang, Feiyu Xiong, Bo Tang

Viaarxiv icon

HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 29, 2023
Hao Wang, Bo Tang, Chi Harold Liu, Shangqin Mao, Jiahong Zhou, Zipeng Dai, Yaqi Sun, Qianlong Xie, Xingxing Wang, Dong Wang

Viaarxiv icon