Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval

Mar 21, 2022

Wei Zhong, Jheng-Hong Yang, Jimmy Lin

Figure 1 for Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval

Figure 2 for Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval

Figure 3 for Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval

Share this with someone who'll enjoy it:

Abstract:With the recent success of dense retrieval methods based on bi-encoders, a number of studies have applied this approach to various interesting downstream retrieval tasks with good efficiency and in-domain effectiveness. Recently, we have also seen the presence of dense retrieval models in Math Information Retrieval (MIR) tasks, but the most effective systems remain "classic" retrieval methods that consider rich structure features. In this work, we try to combine the best of both worlds: a well-defined structure search method for effective formula search and bi-encoder dense retrieval models to capture contextual similarities in mathematical documents. Specifically, we have evaluated two representative bi-encoder models (ColBERT and DPR) for token-level and passage-level dense retrieval on recent MIR tasks. To our best knowledge, this is the first time a DPR model has been evaluated in the MIR domain. Our result shows that bi-encoder models are complementary to existing structure search methods, and we are able to advance the state of the art on a recent MIR dataset. We have made our model checkpoints and source code publicly available for the reproduction of our results.

View paper on

Share this with someone who'll enjoy it:

Title:Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval

Paper and Code