Alert button
Picture for Linfeng Song

Linfeng Song

Alert button

Self-Consistency Boosts Calibration for Math Reasoning

Add code
Bookmark button
Alert button
Mar 14, 2024
Ante Wang, Linfeng Song, Ye Tian, Baolin Peng, Lifeng Jin, Haitao Mi, Jinsong Su, Dong Yu

Figure 1 for Self-Consistency Boosts Calibration for Math Reasoning
Figure 2 for Self-Consistency Boosts Calibration for Math Reasoning
Figure 3 for Self-Consistency Boosts Calibration for Math Reasoning
Figure 4 for Self-Consistency Boosts Calibration for Math Reasoning
Viaarxiv icon

A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation

Add code
Bookmark button
Alert button
Mar 06, 2024
Xiangci Li, Linfeng Song, Lifeng Jin, Haitao Mi, Jessica Ouyang, Dong Yu

Figure 1 for A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation
Figure 2 for A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation
Figure 3 for A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation
Figure 4 for A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation
Viaarxiv icon

Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal

Add code
Bookmark button
Alert button
Mar 02, 2024
Jianheng Huang, Leyang Cui, Ante Wang, Chengyi Yang, Xinting Liao, Linfeng Song, Junfeng Yao, Jinsong Su

Figure 1 for Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
Figure 2 for Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
Figure 3 for Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
Figure 4 for Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
Viaarxiv icon

Collaborative decoding of critical tokens for boosting factuality of large language models

Add code
Bookmark button
Alert button
Feb 28, 2024
Lifeng Jin, Baolin Peng, Linfeng Song, Haitao Mi, Ye Tian, Dong Yu

Viaarxiv icon

Fine-Grained Self-Endorsement Improves Factuality and Reasoning

Add code
Bookmark button
Alert button
Feb 23, 2024
Ante Wang, Linfeng Song, Baolin Peng, Ye Tian, Lifeng Jin, Haitao Mi, Jinsong Su, Dong Yu

Viaarxiv icon

Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation

Add code
Bookmark button
Alert button
Feb 14, 2024
Xiaoying Zhang, Baolin Peng, Ye Tian, Jingyan Zhou, Lifeng Jin, Linfeng Song, Haitao Mi, Helen Meng

Viaarxiv icon

Inconsistent dialogue responses and how to recover from them

Add code
Bookmark button
Alert button
Jan 18, 2024
Mian Zhang, Lifeng Jin, Linfeng Song, Haitao Mi, Dong Yu

Viaarxiv icon

Response Enhanced Semi-Supervised Dialogue Query Generation

Add code
Bookmark button
Alert button
Dec 20, 2023
Jianheng Huang, Ante Wang, Linfeng Gao, Linfeng Song, Jinsong Su

Viaarxiv icon

The Trickle-down Impact of Reward (In-)consistency on RLHF

Add code
Bookmark button
Alert button
Sep 28, 2023
Lingfeng Shen, Sihao Chen, Linfeng Song, Lifeng Jin, Baolin Peng, Haitao Mi, Daniel Khashabi, Dong Yu

Figure 1 for The Trickle-down Impact of Reward (In-)consistency on RLHF
Figure 2 for The Trickle-down Impact of Reward (In-)consistency on RLHF
Figure 3 for The Trickle-down Impact of Reward (In-)consistency on RLHF
Figure 4 for The Trickle-down Impact of Reward (In-)consistency on RLHF
Viaarxiv icon

Stabilizing RLHF through Advantage Model and Selective Rehearsal

Add code
Bookmark button
Alert button
Sep 18, 2023
Baolin Peng, Linfeng Song, Ye Tian, Lifeng Jin, Haitao Mi, Dong Yu

Figure 1 for Stabilizing RLHF through Advantage Model and Selective Rehearsal
Figure 2 for Stabilizing RLHF through Advantage Model and Selective Rehearsal
Figure 3 for Stabilizing RLHF through Advantage Model and Selective Rehearsal
Figure 4 for Stabilizing RLHF through Advantage Model and Selective Rehearsal
Viaarxiv icon