Picture for Lingfeng Shen

Lingfeng Shen

It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF

Add code
Jun 12, 2024
Viaarxiv icon

DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation

Add code
May 22, 2024
Viaarxiv icon

AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies

Add code
Feb 19, 2024
Viaarxiv icon

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Add code
Feb 02, 2024
Figure 1 for Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
Figure 2 for Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
Figure 3 for Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
Figure 4 for Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
Viaarxiv icon

The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts

Add code
Jan 23, 2024
Viaarxiv icon

Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles

Add code
Nov 04, 2023
Viaarxiv icon

Do pretrained Transformers Really Learn In-context by Gradient Descent?

Add code
Oct 12, 2023
Figure 1 for Do pretrained Transformers Really Learn In-context by Gradient Descent?
Figure 2 for Do pretrained Transformers Really Learn In-context by Gradient Descent?
Figure 3 for Do pretrained Transformers Really Learn In-context by Gradient Descent?
Figure 4 for Do pretrained Transformers Really Learn In-context by Gradient Descent?
Viaarxiv icon

SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation

Add code
Oct 06, 2023
Figure 1 for SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
Figure 2 for SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
Figure 3 for SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
Figure 4 for SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
Viaarxiv icon

The Trickle-down Impact of Reward consistency on RLHF

Add code
Sep 28, 2023
Figure 1 for The Trickle-down Impact of Reward consistency on RLHF
Figure 2 for The Trickle-down Impact of Reward consistency on RLHF
Figure 3 for The Trickle-down Impact of Reward consistency on RLHF
Figure 4 for The Trickle-down Impact of Reward consistency on RLHF
Viaarxiv icon

Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model

Add code
Jun 04, 2023
Figure 1 for Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model
Figure 2 for Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model
Figure 3 for Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model
Figure 4 for Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model
Viaarxiv icon