Alert button
Picture for Linfeng Song

Linfeng Song

Alert button

The Trickle-down Impact of Reward (In-)consistency on RLHF

Add code
Bookmark button
Alert button
Sep 28, 2023
Lingfeng Shen, Sihao Chen, Linfeng Song, Lifeng Jin, Baolin Peng, Haitao Mi, Daniel Khashabi, Dong Yu

Figure 1 for The Trickle-down Impact of Reward (In-)consistency on RLHF
Figure 2 for The Trickle-down Impact of Reward (In-)consistency on RLHF
Figure 3 for The Trickle-down Impact of Reward (In-)consistency on RLHF
Figure 4 for The Trickle-down Impact of Reward (In-)consistency on RLHF
Viaarxiv icon

Stabilizing RLHF through Advantage Model and Selective Rehearsal

Add code
Bookmark button
Alert button
Sep 18, 2023
Baolin Peng, Linfeng Song, Ye Tian, Lifeng Jin, Haitao Mi, Dong Yu

Figure 1 for Stabilizing RLHF through Advantage Model and Selective Rehearsal
Figure 2 for Stabilizing RLHF through Advantage Model and Selective Rehearsal
Figure 3 for Stabilizing RLHF through Advantage Model and Selective Rehearsal
Figure 4 for Stabilizing RLHF through Advantage Model and Selective Rehearsal
Viaarxiv icon

Discrete Conditional Diffusion for Reranking in Recommendation

Add code
Bookmark button
Alert button
Aug 14, 2023
Xiao Lin, Xiaokai Chen, Chenyang Wang, Hantao Shu, Linfeng Song, Biao Li, Peng jiang

Figure 1 for Discrete Conditional Diffusion for Reranking in Recommendation
Figure 2 for Discrete Conditional Diffusion for Reranking in Recommendation
Figure 3 for Discrete Conditional Diffusion for Reranking in Recommendation
Figure 4 for Discrete Conditional Diffusion for Reranking in Recommendation
Viaarxiv icon

Tree based Progressive Regression Model for Watch-Time Prediction in Short-video Recommendation

Add code
Bookmark button
Alert button
Jun 06, 2023
Xiao Lin, Xiaokai Chen, Linfeng Song, Jingwei Liu, Biao Li, Peng Jiang

Figure 1 for Tree based Progressive Regression Model for Watch-Time Prediction in Short-video Recommendation
Figure 2 for Tree based Progressive Regression Model for Watch-Time Prediction in Short-video Recommendation
Figure 3 for Tree based Progressive Regression Model for Watch-Time Prediction in Short-video Recommendation
Figure 4 for Tree based Progressive Regression Model for Watch-Time Prediction in Short-video Recommendation
Viaarxiv icon

A Survey on Zero Pronoun Translation

Add code
Bookmark button
Alert button
May 17, 2023
Longyue Wang, Siyou Liu, Mingzhou Xu, Linfeng Song, Shuming Shi, Zhaopeng Tu

Figure 1 for A Survey on Zero Pronoun Translation
Figure 2 for A Survey on Zero Pronoun Translation
Figure 3 for A Survey on Zero Pronoun Translation
Figure 4 for A Survey on Zero Pronoun Translation
Viaarxiv icon

Search-Engine-augmented Dialogue Response Generation with Cheaply Supervised Query Production

Add code
Bookmark button
Alert button
Feb 16, 2023
Ante Wang, Linfeng Song, Qi Liu, Haitao Mi, Longyue Wang, Zhaopeng Tu, Jinsong Su, Dong Yu

Figure 1 for Search-Engine-augmented Dialogue Response Generation with Cheaply Supervised Query Production
Figure 2 for Search-Engine-augmented Dialogue Response Generation with Cheaply Supervised Query Production
Figure 3 for Search-Engine-augmented Dialogue Response Generation with Cheaply Supervised Query Production
Figure 4 for Search-Engine-augmented Dialogue Response Generation with Cheaply Supervised Query Production
Viaarxiv icon

Friend-training: Learning from Models of Different but Related Tasks

Add code
Bookmark button
Alert button
Jan 31, 2023
Mian Zhang, Lifeng Jin, Linfeng Song, Haitao Mi, Xiabing Zhou, Dong Yu

Figure 1 for Friend-training: Learning from Models of Different but Related Tasks
Figure 2 for Friend-training: Learning from Models of Different but Related Tasks
Figure 3 for Friend-training: Learning from Models of Different but Related Tasks
Figure 4 for Friend-training: Learning from Models of Different but Related Tasks
Viaarxiv icon

Getting the Most out of Simile Recognition

Add code
Bookmark button
Alert button
Nov 11, 2022
Xiaoyue Wang, Linfeng Song, Xin Liu, Chulun Zhou, Jinsong Su

Figure 1 for Getting the Most out of Simile Recognition
Figure 2 for Getting the Most out of Simile Recognition
Figure 3 for Getting the Most out of Simile Recognition
Figure 4 for Getting the Most out of Simile Recognition
Viaarxiv icon

Discover, Explanation, Improvement: Automatic Slice Detection Framework for Natural Language Processing

Add code
Bookmark button
Alert button
Nov 08, 2022
Wenyue Hua, Lifeng Jin, Linfeng Song, Haitao Mi, Yongfeng Zhang, Dong Yu

Figure 1 for Discover, Explanation, Improvement: Automatic Slice Detection Framework for Natural Language Processing
Figure 2 for Discover, Explanation, Improvement: Automatic Slice Detection Framework for Natural Language Processing
Figure 3 for Discover, Explanation, Improvement: Automatic Slice Detection Framework for Natural Language Processing
Figure 4 for Discover, Explanation, Improvement: Automatic Slice Detection Framework for Natural Language Processing
Viaarxiv icon

Semantic-based Pre-training for Dialogue Understanding

Add code
Bookmark button
Alert button
Sep 19, 2022
Xuefeng Bai, Linfeng Song, Yue Zhang

Figure 1 for Semantic-based Pre-training for Dialogue Understanding
Figure 2 for Semantic-based Pre-training for Dialogue Understanding
Figure 3 for Semantic-based Pre-training for Dialogue Understanding
Figure 4 for Semantic-based Pre-training for Dialogue Understanding
Viaarxiv icon