Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Linfeng Cao

Provably Efficient Personalized Multi-Objective Bandits with Proactive Conversational Queries

Jun 07, 2026

Linfeng Cao, Ming Shi, Ness B. Shroff

Abstract:Personalized decision-making in multi-objective bandits requires learning user-specific trade-offs among competing objectives. Since arm utility depends on both unknown rewards and unknown preferences, existing methods infer preferences only from utility feedback, entangling preference learning with reward exploration. In practice, however, users often reveal their priorities through proactive conversational queries (e.g., "cheap and clean hotel"), yet this structured signal is not leveraged. We formalize a proactive query-based framework in which user queries provide structured preference signals. Modeling these signals via a Plackett-Luce subset choice model, we show that query-only learning is insufficient due to a fundamental shift-invariance barrier. To resolve this, we introduce MO-PQUCB, a hybrid algorithm that integrates query-based preference anchoring with bandit feedback through shift-invariant regularization and dual-exploration UCB. We prove that proactive queries accelerate preference estimation and yield improved regret scaling over prior preference-aware MO-MAB methods. Under corrupted queries, we further characterize statistical limits and design a robust estimator achieving near-optimal performance when the corruption is sparse. Experiments validate both theoretical and practical gains.

* UAI 2026

Via

Access Paper or Ask Questions

Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization

Feb 19, 2025

Linfeng Cao, Ming Shi, Ness B. Shroff

Figure 1 for Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization

Figure 2 for Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization

Figure 3 for Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization

Figure 4 for Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization

Abstract:Multi-objective multi-armed bandit (MO-MAB) problems traditionally aim to achieve Pareto optimality. However, real-world scenarios often involve users with varying preferences across objectives, resulting in a Pareto-optimal arm that may score high for one user but perform quite poorly for another. This highlights the need for customized learning, a factor often overlooked in prior research. To address this, we study a preference-aware MO-MAB framework in the presence of explicit user preference. It shifts the focus from achieving Pareto optimality to further optimizing within the Pareto front under preference-centric customization. To our knowledge, this is the first theoretical study of customized MO-MAB optimization with explicit user preferences. Motivated by practical applications, we explore two scenarios: unknown preference and hidden preference, each presenting unique challenges for algorithm design and analysis. At the core of our algorithms are preference estimation and preference-aware optimization mechanisms to adapt to user preferences effectively. We further develop novel analytical techniques to establish near-optimal regret of the proposed algorithms. Strong empirical performance confirm the effectiveness of our approach.

Via

Access Paper or Ask Questions

Graph-Skeleton: ~1% Nodes are Sufficient to Represent Billion-Scale Graph

Feb 14, 2024

Linfeng Cao, Haoran Deng, Chunping Wang, Lei Chen, Yang Yang

Figure 1 for Graph-Skeleton: ~1% Nodes are Sufficient to Represent Billion-Scale Graph

Figure 2 for Graph-Skeleton: ~1% Nodes are Sufficient to Represent Billion-Scale Graph

Figure 3 for Graph-Skeleton: ~1% Nodes are Sufficient to Represent Billion-Scale Graph

Figure 4 for Graph-Skeleton: ~1% Nodes are Sufficient to Represent Billion-Scale Graph

Abstract:Due to the ubiquity of graph data on the web, web graph mining has become a hot research spot. Nonetheless, the prevalence of large-scale web graphs in real applications poses significant challenges to storage, computational capacity and graph model design. Despite numerous studies to enhance the scalability of graph models, a noticeable gap remains between academic research and practical web graph mining applications. One major cause is that in most industrial scenarios, only a small part of nodes in a web graph are actually required to be analyzed, where we term these nodes as target nodes, while others as background nodes. In this paper, we argue that properly fetching and condensing the background nodes from massive web graph data might be a more economical shortcut to tackle the obstacles fundamentally. To this end, we make the first attempt to study the problem of massive background nodes compression for target nodes classification. Through extensive experiments, we reveal two critical roles played by the background nodes in target node classification: enhancing structural connectivity between target nodes, and feature correlation with target nodes. Followingthis, we propose a novel Graph-Skeleton1 model, which properly fetches the background nodes, and further condenses the semantic and topological information of background nodes within similar target-background local structures. Extensive experiments on various web graph datasets demonstrate the effectiveness and efficiency of the proposed method. In particular, for MAG240M dataset with 0.24 billion nodes, our generated skeleton graph achieves highly comparable performance while only containing 1.8% nodes of the original graph.

* 21 pages, 11 figures, In Proceedings of the ACM Web Conference 2024 (WWW'24)

Via

Access Paper or Ask Questions