Picture for Hamish Ivison

Hamish Ivison

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Add code
Jun 13, 2024
Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Add code
Nov 20, 2023
Figure 1 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Figure 2 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Figure 3 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Figure 4 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Viaarxiv icon

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources

Add code
Jun 07, 2023
Figure 1 for How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources
Figure 2 for How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources
Figure 3 for How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources
Figure 4 for How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources
Viaarxiv icon

TESS: Text-to-Text Self-Conditioned Simplex Diffusion

Add code
May 15, 2023
Figure 1 for TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Figure 2 for TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Figure 3 for TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Figure 4 for TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Viaarxiv icon

HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation

Add code
Dec 20, 2022
Figure 1 for HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation
Figure 2 for HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation
Figure 3 for HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation
Figure 4 for HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation
Viaarxiv icon

Data-Efficient Finetuning Using Cross-Task Nearest Neighbors

Add code
Dec 01, 2022
Figure 1 for Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
Figure 2 for Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
Figure 3 for Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
Figure 4 for Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
Viaarxiv icon

Hyperdecoders: Instance-specific decoders for multi-task NLP

Add code
Mar 15, 2022
Figure 1 for Hyperdecoders: Instance-specific decoders for multi-task NLP
Figure 2 for Hyperdecoders: Instance-specific decoders for multi-task NLP
Figure 3 for Hyperdecoders: Instance-specific decoders for multi-task NLP
Figure 4 for Hyperdecoders: Instance-specific decoders for multi-task NLP
Viaarxiv icon

Local Interpretations for Explainable Natural Language Processing: A Survey

Add code
Mar 20, 2021
Figure 1 for Local Interpretations for Explainable Natural Language Processing: A Survey
Figure 2 for Local Interpretations for Explainable Natural Language Processing: A Survey
Figure 3 for Local Interpretations for Explainable Natural Language Processing: A Survey
Figure 4 for Local Interpretations for Explainable Natural Language Processing: A Survey
Viaarxiv icon