Picture for Yue Wu

Yue Wu

Delving into the Reversal Curse: How Far Can Large Language Models Generalize?

Add code
Oct 24, 2024
Figure 1 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Figure 2 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Figure 3 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Figure 4 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Viaarxiv icon

TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

Add code
Oct 18, 2024
Figure 1 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 2 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 3 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 4 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Viaarxiv icon

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Add code
Oct 17, 2024
Viaarxiv icon

Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection

Add code
Oct 03, 2024
Figure 1 for Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Figure 2 for Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Figure 3 for Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Figure 4 for Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Viaarxiv icon

General Preference Modeling with Preference Representations for Aligning Language Models

Add code
Oct 03, 2024
Figure 1 for General Preference Modeling with Preference Representations for Aligning Language Models
Figure 2 for General Preference Modeling with Preference Representations for Aligning Language Models
Figure 3 for General Preference Modeling with Preference Representations for Aligning Language Models
Figure 4 for General Preference Modeling with Preference Representations for Aligning Language Models
Viaarxiv icon

Infer Human's Intentions Before Following Natural Language Instructions

Add code
Sep 26, 2024
Figure 1 for Infer Human's Intentions Before Following Natural Language Instructions
Figure 2 for Infer Human's Intentions Before Following Natural Language Instructions
Figure 3 for Infer Human's Intentions Before Following Natural Language Instructions
Figure 4 for Infer Human's Intentions Before Following Natural Language Instructions
Viaarxiv icon

Triple Point Masking

Add code
Sep 26, 2024
Viaarxiv icon

2DSig-Detect: a semi-supervised framework for anomaly detection on image data using 2D-signatures

Add code
Sep 08, 2024
Viaarxiv icon

OCTCube: A 3D foundation model for optical coherence tomography that improves cross-dataset, cross-disease, cross-device and cross-modality analysis

Add code
Aug 20, 2024
Viaarxiv icon

Educating LLMs like Human Students: Structure-aware Injection of Domain Knowledge

Add code
Jul 23, 2024
Viaarxiv icon