Headless Language Models: Learning without Predicting with Contrastive Weight Tying

Add code
Sep 15, 2023
Figure 1 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Figure 2 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Figure 3 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Figure 4 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: