Picture for Shuming Shi

Shuming Shi

Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning

Add code
Jun 25, 2024
Viaarxiv icon

On the Transformations across Reward Model, Parameter Update, and In-Context Prompt

Add code
Jun 24, 2024
Viaarxiv icon

Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction

Add code
May 22, 2024
Viaarxiv icon

Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text

Add code
May 21, 2024
Figure 1 for Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text
Figure 2 for Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text
Figure 3 for Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text
Figure 4 for Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text
Viaarxiv icon

Sequence can Secretly Tell You What to Discard

Add code
Apr 24, 2024
Figure 1 for Sequence can Secretly Tell You What to Discard
Figure 2 for Sequence can Secretly Tell You What to Discard
Figure 3 for Sequence can Secretly Tell You What to Discard
Figure 4 for Sequence can Secretly Tell You What to Discard
Viaarxiv icon

Retrieval is Accurate Generation

Add code
Feb 29, 2024
Viaarxiv icon

Mitigating Hallucinations of Large Language Models via Knowledge Consistent Alignment

Add code
Jan 28, 2024
Viaarxiv icon

Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model

Add code
Jan 23, 2024
Figure 1 for Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model
Figure 2 for Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model
Figure 3 for Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model
Figure 4 for Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model
Viaarxiv icon

Benchmarking LLMs via Uncertainty Quantification

Add code
Jan 23, 2024
Viaarxiv icon

Knowledge Fusion of Large Language Models

Add code
Jan 22, 2024
Viaarxiv icon