Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Demystifying Self-Supervised Learning: An Information-Theoretical Framework

Jun 11, 2020

Yao-Hung Hubert Tsai, Yue Wu, Ruslan Salakhutdinov, Louis-Philippe Morency

Figure 1 for Demystifying Self-Supervised Learning: An Information-Theoretical Framework

Figure 2 for Demystifying Self-Supervised Learning: An Information-Theoretical Framework

Figure 3 for Demystifying Self-Supervised Learning: An Information-Theoretical Framework

Figure 4 for Demystifying Self-Supervised Learning: An Information-Theoretical Framework

Share this with someone who'll enjoy it:

Abstract:Self-supervised representation learning adopts self-defined signals as supervision and uses the learned representation for downstream tasks, such as masked language modeling (e.g., BERT) for natural language processing and contrastive visual representation learning (e.g., SimCLR) for computer vision applications. In this paper, we present a theoretical framework explaining that self-supervised learning is likely to work under the assumption that only the shared information (e.g., contextual information or content) between the input (e.g., non-masked words or original images) and self-supervised signals (e.g., masked-words or augmented images) contributes to downstream tasks. Under this assumption, we demonstrate that self-supervisedly learned representation can extract task-relevant and discard task-irrelevant information. We further connect our theoretical analysis to popular contrastive and predictive (self-supervised) learning objectives. In the experimental section, we provide controlled experiments on two popular tasks: 1) visual representation learning with various self-supervised learning objectives to empirically support our analysis; and 2) visual-textual representation learning to challenge that input and self-supervised signal lie in different modalities.

View paper on

Share this with someone who'll enjoy it:

Title:Demystifying Self-Supervised Learning: An Information-Theoretical Framework

Paper and Code