Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xubo Qin

TongSearch-QR: Reinforced Query Reasoning for Retrieval

Jun 16, 2025

Xubo Qin, Jun Bai, Jiaqi Li, Zixia Jia, Zilong Zheng

Figure 1 for TongSearch-QR: Reinforced Query Reasoning for Retrieval

Figure 2 for TongSearch-QR: Reinforced Query Reasoning for Retrieval

Figure 3 for TongSearch-QR: Reinforced Query Reasoning for Retrieval

Figure 4 for TongSearch-QR: Reinforced Query Reasoning for Retrieval

Abstract:Traditional information retrieval (IR) methods excel at textual and semantic matching but struggle in reasoning-intensive retrieval tasks that require multi-hop inference or complex semantic understanding between queries and documents. One promising solution is to explicitly rewrite or augment queries using large language models (LLMs) to elicit reasoning-relevant content prior to retrieval. However, the widespread use of large-scale language models like GPT-4 or LLaMA3-70B remains impractical due to their high inference cost and limited deployability in real-world systems. In this work, we introduce TongSearch QR (Previously Known as "TongSearch Reasoner"), a family of small-scale language models for query reasoning and rewriting in reasoning-intensive retrieval. With a novel semi-rule-based reward function, we employ reinforcement learning approaches enabling smaller language models, e,g, Qwen2.5-7B-Instruct and Qwen2.5-1.5B-Instruct, to achieve query reasoning performance rivaling large-scale language models without their prohibitive inference costs. Experiment results on BRIGHT benchmark show that with BM25 as retrievers, both TongSearch QR-7B and TongSearch QR-1.5B models significantly outperform existing baselines, including prompt-based query reasoners and some latest dense retrievers trained for reasoning-intensive retrieval tasks, offering superior adaptability for real-world deployment.

Via

Access Paper or Ask Questions

An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking

Feb 08, 2023

Xubo Qin, Xiyuan Liu, Xiongfeng Zheng, Jie Liu, Yutao Zhu

Figure 1 for An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking

Figure 2 for An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking

Figure 3 for An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking

Figure 4 for An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking

Abstract:Although BERT-based ranking models have been commonly used in commercial search engines, they are usually time-consuming for online ranking tasks. Knowledge distillation, which aims at learning a smaller model with comparable performance to a larger model, is a common strategy for reducing the online inference latency. In this paper, we investigate the effect of different loss functions for uniform-architecture distillation of BERT-based ranking models. Here "uniform-architecture" denotes that both teacher and student models are in cross-encoder architecture, while the student models include small-scaled pre-trained language models. Our experimental results reveal that the optimal distillation configuration for ranking tasks is much different than general natural language processing tasks. Specifically, when the student models are in cross-encoder architecture, a pairwise loss of hard labels is critical for training student models, whereas the distillation objectives of intermediate Transformer layers may hurt performance. These findings emphasize the necessity of carefully designing a distillation strategy (for cross-encoder student models) tailored for document ranking with pairwise training samples.

Via

Access Paper or Ask Questions