Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shanxiu He

Dynamic Superblock Pruning for Fast Learned Sparse Retrieval

Apr 23, 2025

Parker Carlson, Wentai Xie, Shanxiu He, Tao Yang

Abstract:This paper proposes superblock pruning (SP) during top-k online document retrieval for learned sparse representations. SP structures the sparse index as a set of superblocks on a sequence of document blocks and conducts a superblock-level selection to decide if some superblocks can be pruned before visiting their child blocks. SP generalizes the previous flat block or cluster-based pruning, allowing the early detection of groups of documents that cannot or are less likely to appear in the final top-k list. SP can accelerate sparse retrieval in a rank-safe or approximate manner under a high-relevance competitiveness constraint. Our experiments show that the proposed scheme significantly outperforms state-of-the-art baselines on MS MARCO passages on a single-threaded CPU.

* 6 pages, 3 figures, SIGIR 25

Via

Access Paper or Ask Questions

Weighted KL-Divergence for Document Ranking Model Refinement

Jun 10, 2024

Yingrui Yang, Yifan Qiao, Shanxiu He, Tao Yang

Figure 1 for Weighted KL-Divergence for Document Ranking Model Refinement

Figure 2 for Weighted KL-Divergence for Document Ranking Model Refinement

Figure 3 for Weighted KL-Divergence for Document Ranking Model Refinement

Figure 4 for Weighted KL-Divergence for Document Ranking Model Refinement

Abstract:Transformer-based retrieval and reranking models for text document search are often refined through knowledge distillation together with contrastive learning. A tight distribution matching between the teacher and student models can be hard as over-calibration may degrade training effectiveness when a teacher does not perform well. This paper contrastively reweights KL divergence terms to prioritize the alignment between a student and a teacher model for proper separation of positive and negative documents. This paper analyzes and evaluates the proposed loss function on the MS MARCO and BEIR datasets to demonstrate its effectiveness in improving the relevance of tested student models.

Via

Access Paper or Ask Questions

Approximate Cluster-Based Sparse Document Retrieval with Segmented Maximum Term Weights

Apr 13, 2024

Yifan Qiao, Shanxiu He, Yingrui Yang, Parker Carlson, Tao Yang

Figure 1 for Approximate Cluster-Based Sparse Document Retrieval with Segmented Maximum Term Weights

Figure 2 for Approximate Cluster-Based Sparse Document Retrieval with Segmented Maximum Term Weights

Figure 3 for Approximate Cluster-Based Sparse Document Retrieval with Segmented Maximum Term Weights

Figure 4 for Approximate Cluster-Based Sparse Document Retrieval with Segmented Maximum Term Weights

Abstract:This paper revisits cluster-based retrieval that partitions the inverted index into multiple groups and skips the index partially at cluster and document levels during online inference using a learned sparse representation. It proposes an approximate search scheme with two parameters to control the rank-safeness competitiveness of pruning with segmented maximum term weights within each cluster. Cluster-level maximum weight segmentation allows an improvement in the rank score bound estimation and threshold-based pruning to be approximately adaptive to bound estimation tightness, resulting in better relevance and efficiency. The experiments with MS MARCO passage ranking and BEIR datasets demonstrate the usefulness of the proposed scheme with a comparison to the baselines. This paper presents the design of this approximate retrieval scheme with rank-safeness analysis, compares clustering and segmentation options, and reports evaluation results.

Via

Access Paper or Ask Questions

Representation Sparsification with Hybrid Thresholding for Fast SPLADE-based Document Retrieval

Jun 20, 2023

Yifan Qiao, Yingrui Yang, Shanxiu He, Tao Yang

Figure 1 for Representation Sparsification with Hybrid Thresholding for Fast SPLADE-based Document Retrieval

Figure 2 for Representation Sparsification with Hybrid Thresholding for Fast SPLADE-based Document Retrieval

Figure 3 for Representation Sparsification with Hybrid Thresholding for Fast SPLADE-based Document Retrieval

Figure 4 for Representation Sparsification with Hybrid Thresholding for Fast SPLADE-based Document Retrieval

Abstract:Learned sparse document representations using a transformer-based neural model has been found to be attractive in both relevance effectiveness and time efficiency. This paper describes a representation sparsification scheme based on hard and soft thresholding with an inverted index approximation for faster SPLADE-based document retrieval. It provides analytical and experimental results on the impact of this learnable hybrid thresholding scheme.

* Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval 2023
* This paper is published in SIGIR'23

Via

Access Paper or Ask Questions