Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Sparse, Dense, and Attentional Representations for Text Retrieval

May 01, 2020

Yi Luan, Jacob Eisenstein, Kristina Toutanova, Michael Collins

Figure 1 for Sparse, Dense, and Attentional Representations for Text Retrieval

Figure 2 for Sparse, Dense, and Attentional Representations for Text Retrieval

Figure 3 for Sparse, Dense, and Attentional Representations for Text Retrieval

Figure 4 for Sparse, Dense, and Attentional Representations for Text Retrieval

Share this with someone who'll enjoy it:

Abstract:Dual encoder architectures perform retrieval by encoding documents and queries into dense low-dimensional vectors, and selecting the document that has the highest inner product with the query. We investigate the capacity of this architecture relative to sparse bag-of-words retrieval models and attentional neural networks. We establish new connections between the encoding dimension and the number of unique terms in each document and query, using both theoretical and empirical analysis. We show an upper bound on the encoding size, which may be unsustainably large for long documents. For cross-attention models, we show an upper bound using much smaller encodings per token, but such models are difficult to scale to realistic retrieval problems due to computational cost. Building on these insights, we propose a simple neural model that combines the efficiency of dual encoders with some of the expressiveness of attentional architectures, and explore a sparse-dense hybrid to capitalize on the precision of sparse retrieval. These models outperform strong alternatives in open retrieval.

View paper on

Share this with someone who'll enjoy it:

Title:Sparse, Dense, and Attentional Representations for Text Retrieval

Paper and Code