Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Universal Sentence Representation Learning with Conditional Masked Language Model

Dec 29, 2020

Ziyi Yang, Yinfei Yang, Daniel Cer, Jax Law, Eric Darve

Figure 1 for Universal Sentence Representation Learning with Conditional Masked Language Model

Figure 2 for Universal Sentence Representation Learning with Conditional Masked Language Model

Figure 3 for Universal Sentence Representation Learning with Conditional Masked Language Model

Figure 4 for Universal Sentence Representation Learning with Conditional Masked Language Model

Share this with someone who'll enjoy it:

Abstract:This paper presents a novel training method, Conditional Masked Language Modeling (CMLM), to effectively learn sentence representations on large scale unlabeled corpora. CMLM integrates sentence representation learning into MLM training by conditioning on the encoded vectors of adjacent sentences. Our English CMLM model achieves state-of-the-art performance on SentEval, even outperforming models learned using (semi-)supervised signals. As a fully unsupervised learning method, CMLM can be conveniently extended to a broad range of languages and domains. We find that a multilingual CMLM model co-trained with bitext retrieval~(BR) and natural language inference~(NLI) tasks outperforms the previous state-of-the-art multilingual models by a large margin. We explore the same language bias of the learned representations, and propose a principle component based approach to remove the language identifying information from the representation while still retaining sentence semantics.

* preprint, updated license

View paper on

Share this with someone who'll enjoy it:

Title:Universal Sentence Representation Learning with Conditional Masked Language Model

Paper and Code