Picture for Lin Gui

Lin Gui

Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems

Add code
Jun 27, 2024
Viaarxiv icon

Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective

Add code
Jun 25, 2024
Figure 1 for Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective
Figure 2 for Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective
Figure 3 for Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective
Figure 4 for Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective
Viaarxiv icon

Multi-Layer Ranking with Large Language Models for News Source Recommendation

Add code
Jun 17, 2024
Viaarxiv icon

BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling

Add code
Jun 02, 2024
Viaarxiv icon

PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery Games

Add code
Apr 26, 2024
Viaarxiv icon

COPR: Continual Human Preference Learning via Optimal Policy Regularization

Add code
Feb 27, 2024
Figure 1 for COPR: Continual Human Preference Learning via Optimal Policy Regularization
Figure 2 for COPR: Continual Human Preference Learning via Optimal Policy Regularization
Figure 3 for COPR: Continual Human Preference Learning via Optimal Policy Regularization
Figure 4 for COPR: Continual Human Preference Learning via Optimal Policy Regularization
Viaarxiv icon

Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models

Add code
Feb 23, 2024
Viaarxiv icon

Counterfactual Generation with Identifiability Guarantees

Add code
Feb 23, 2024
Figure 1 for Counterfactual Generation with Identifiability Guarantees
Figure 2 for Counterfactual Generation with Identifiability Guarantees
Figure 3 for Counterfactual Generation with Identifiability Guarantees
Figure 4 for Counterfactual Generation with Identifiability Guarantees
Viaarxiv icon

Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning

Add code
Feb 22, 2024
Figure 1 for Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning
Figure 2 for Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning
Figure 3 for Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning
Figure 4 for Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning
Viaarxiv icon

Mitigating Biases of Large Language Models in Stance Detection with Calibration

Add code
Feb 22, 2024
Figure 1 for Mitigating Biases of Large Language Models in Stance Detection with Calibration
Figure 2 for Mitigating Biases of Large Language Models in Stance Detection with Calibration
Figure 3 for Mitigating Biases of Large Language Models in Stance Detection with Calibration
Figure 4 for Mitigating Biases of Large Language Models in Stance Detection with Calibration
Viaarxiv icon