Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fuli Feng

University of Science and Technology of China

Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning

Dec 19, 2024

Ziang Ye, Zhenru Zhang, Yang Zhang, Jianxin Ma, Junyang Lin, Fuli Feng

Figure 1 for Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning

Figure 2 for Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning

Figure 3 for Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning

Figure 4 for Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning

Abstract:When using agent-task datasets to enhance agent capabilities for Large Language Models (LLMs), current methodologies often treat all tokens within a sample equally. However, we argue that tokens serving different roles - specifically, reasoning tokens versus boilerplate tokens (e.g., those governing output format) - differ significantly in importance and learning complexity, necessitating their disentanglement and distinct treatment. To address this, we propose a novel Shuffle-Aware Discriminator (SHAD) for adaptive token discrimination. SHAD classifies tokens by exploiting predictability differences observed after shuffling input-output combinations across samples: boilerplate tokens, due to their repetitive nature among samples, maintain predictability, whereas reasoning tokens do not. Using SHAD, we propose the Reasoning-highlighted Fine-Tuning (RFT) method, which adaptively emphasizes reasoning tokens during fine-tuning, yielding notable performance gains over common Supervised Fine-Tuning (SFT).

Via

Access Paper or Ask Questions

Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning

Oct 30, 2024

Keqin Bao, Ming Yan, Yang Zhang, Jizhi Zhang, Wenjie Wang, Fuli Feng, Xiangnan He

Figure 1 for Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning

Figure 2 for Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning

Figure 3 for Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning

Figure 4 for Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning

Abstract:Frequently updating Large Language Model (LLM)-based recommender systems to adapt to new user interests -- as done for traditional ones -- is impractical due to high training costs, even with acceleration methods. This work explores adapting to dynamic user interests without any model updates by leveraging In-Context Learning (ICL), which allows LLMs to learn new tasks from few-shot examples provided in the input. Using new-interest examples as the ICL few-shot examples, LLMs may learn real-time interest directly, avoiding the need for model updates. However, existing LLM-based recommenders often lose the in-context learning ability during recommendation tuning, while the original LLM's in-context learning lacks recommendation-specific focus. To address this, we propose RecICL, which customizes recommendation-specific in-context learning for real-time recommendations. RecICL organizes training examples in an in-context learning format, ensuring that in-context learning ability is preserved and aligned with the recommendation task during tuning. Extensive experiments demonstrate RecICL's effectiveness in delivering real-time recommendations without requiring model updates. Our code is available at https://github.com/ym689/rec_icl.

Via

Access Paper or Ask Questions

FLOW: A Feedback LOop FrameWork for Simultaneously Enhancing Recommendation and User Agents

Oct 26, 2024

Shihao Cai, Jizhi Zhang, Keqin Bao, Chongming Gao, Fuli Feng

Figure 1 for FLOW: A Feedback LOop FrameWork for Simultaneously Enhancing Recommendation and User Agents

Figure 2 for FLOW: A Feedback LOop FrameWork for Simultaneously Enhancing Recommendation and User Agents

Figure 3 for FLOW: A Feedback LOop FrameWork for Simultaneously Enhancing Recommendation and User Agents

Figure 4 for FLOW: A Feedback LOop FrameWork for Simultaneously Enhancing Recommendation and User Agents

Abstract:Agents powered by large language models have shown remarkable reasoning and execution capabilities, attracting researchers to explore their potential in the recommendation domain. Previous studies have primarily focused on enhancing the capabilities of either recommendation agents or user agents independently, but have not considered the interaction and collaboration between recommendation agents and user agents. To address this gap, we propose a novel framework named FLOW, which achieves collaboration between the recommendation agent and the user agent by introducing a feedback loop. Specifically, the recommendation agent refines its understanding of the user's preferences by analyzing the user agent's feedback on previously suggested items, while the user agent leverages suggested items to uncover deeper insights into the user's latent interests. This iterative refinement process enhances the reasoning capabilities of both the recommendation agent and the user agent, enabling more precise recommendations and a more accurate simulation of user behavior. To demonstrate the effectiveness of the feedback loop, we evaluate both recommendation performance and user simulation performance on three widely used recommendation domain datasets. The experimental results indicate that the feedback loop can simultaneously improve the performance of both the recommendation and user agents.

Via

Access Paper or Ask Questions

MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding

Oct 25, 2024

Fengbin Zhu, Ziyang Liu, Xiang Yao Ng, Haohui Wu, Wenjie Wang, Fuli Feng, Chao Wang, Huanbo Luan, Tat Seng Chua

Figure 1 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding

Figure 2 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding

Figure 3 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding

Figure 4 for MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding

Abstract:Large Vision-Language Models (LVLMs) have achieved remarkable performance in many vision-language tasks, yet their capabilities in fine-grained visual understanding remain insufficiently evaluated. Existing benchmarks either contain limited fine-grained evaluation samples that are mixed with other data, or are confined to object-level assessments in natural images. To holistically assess LVLMs' fine-grained visual understanding capabilities, we propose using document images with multi-granularity and multi-modal information to supplement natural images. In this light, we construct MMDocBench, a benchmark with various OCR-free document understanding tasks for the evaluation of fine-grained visual perception and reasoning abilities. MMDocBench defines 15 main tasks with 4,338 QA pairs and 11,353 supporting regions, covering various document images such as research papers, receipts, financial reports, Wikipedia tables, charts, and infographics. Based on MMDocBench, we conduct extensive experiments using 13 open-source and 3 proprietary advanced LVLMs, assessing their strengths and weaknesses across different tasks and document image types. The benchmark, task instructions, and evaluation code will be made publicly available.

* Under review

Via

Access Paper or Ask Questions

Personalized Image Generation with Large Multimodal Models

Oct 18, 2024

Yiyan Xu, Wenjie Wang, Yang Zhang, Tang Biao, Peng Yan, Fuli Feng, Xiangnan He

Figure 1 for Personalized Image Generation with Large Multimodal Models

Figure 2 for Personalized Image Generation with Large Multimodal Models

Figure 3 for Personalized Image Generation with Large Multimodal Models

Figure 4 for Personalized Image Generation with Large Multimodal Models

Abstract:Personalized content filtering, such as recommender systems, has become a critical infrastructure to alleviate information overload. However, these systems merely filter existing content and are constrained by its limited diversity, making it difficult to meet users' varied content needs. To address this limitation, personalized content generation has emerged as a promising direction with broad applications. Nevertheless, most existing research focuses on personalized text generation, with relatively little attention given to personalized image generation. The limited work in personalized image generation faces challenges in accurately capturing users' visual preferences and needs from noisy user-interacted images and complex multimodal instructions. Worse still, there is a lack of supervised data for training personalized image generation models. To overcome the challenges, we propose a Personalized Image Generation Framework named Pigeon, which adopts exceptional large multimodal models with three dedicated modules to capture users' visual preferences and needs from noisy user history and multimodal instructions. To alleviate the data scarcity, we introduce a two-stage preference alignment scheme, comprising masked preference reconstruction and pairwise preference alignment, to align Pigeon with the personalized image generation task. We apply Pigeon to personalized sticker and movie poster generation, where extensive quantitative results and human evaluation highlight its superiority over various generative baselines.

Via

Access Paper or Ask Questions

Efficient Inference for Large Language Model-based Generative Recommendation

Oct 07, 2024

Xinyu Lin, Chaoqun Yang, Wenjie Wang, Yongqi Li, Cunxiao Du, Fuli Feng, See-Kiong Ng, Tat-Seng Chua

Figure 1 for Efficient Inference for Large Language Model-based Generative Recommendation

Figure 2 for Efficient Inference for Large Language Model-based Generative Recommendation

Figure 3 for Efficient Inference for Large Language Model-based Generative Recommendation

Figure 4 for Efficient Inference for Large Language Model-based Generative Recommendation

Abstract:Large Language Model (LLM)-based generative recommendation has achieved notable success, yet its practical deployment is costly particularly due to excessive inference latency caused by autoregressive decoding. For lossless LLM decoding acceleration, Speculative Decoding (SD) has emerged as a promising solution. However, applying SD to generative recommendation presents unique challenges due to the requirement of generating top-K items (i.e., K distinct token sequences) as a recommendation list by beam search. This leads to more stringent verification in SD, where all the top-K sequences from the target LLM must be successfully drafted by the draft model at each decoding step. To alleviate this, we consider 1) boosting top-K sequence alignment between the draft model and the target LLM, and 2) relaxing the verification strategy to reduce trivial LLM calls. To this end, we propose an alignment framework named AtSpeed, which presents the AtSpeed-S optimization objective for top-K alignment under the strict top-K verification. Moreover, we introduce a relaxed sampling verification strategy that allows high-probability non-top-K drafted sequences to be accepted, significantly reducing LLM calls. Correspondingly, we propose AtSpeed-R for top-K alignment under this relaxed sampling verification. Empirical results on two real-world datasets demonstrate that AtSpeed significantly accelerates LLM-based generative recommendation, e.g., near 2x speedup under strict top-K verification and up to 2.5 speedup under relaxed sampling verification. The codes and datasets will be released in the near future.

Via

Access Paper or Ask Questions

Proactive Recommendation in Social Networks: Steering User Interest via Neighbor Influence

Sep 13, 2024

Hang Pan, Shuxian Bi, Wenjie Wang, Haoxuan Li, Peng Wu, Fuli Feng, Xiangnan He

Figure 1 for Proactive Recommendation in Social Networks: Steering User Interest via Neighbor Influence

Figure 2 for Proactive Recommendation in Social Networks: Steering User Interest via Neighbor Influence

Figure 3 for Proactive Recommendation in Social Networks: Steering User Interest via Neighbor Influence

Figure 4 for Proactive Recommendation in Social Networks: Steering User Interest via Neighbor Influence

Abstract:Recommending items solely catering to users' historical interests narrows users' horizons. Recent works have considered steering target users beyond their historical interests by directly adjusting items exposed to them. However, the recommended items for direct steering might not align perfectly with users' interests evolution, detrimentally affecting target users' experience. To avoid this issue, we propose a new task named Proactive Recommendation in Social Networks (PRSN) that indirectly steers users' interest by utilizing the influence of social neighbors, i.e., indirect steering by adjusting the exposure of a target item to target users' neighbors. The key to PRSN lies in answering an interventional question: what would a target user's feedback be on a target item if the item is exposed to the user's different neighbors? To answer this question, we resort to causal inference and formalize PRSN as: (1) estimating the potential feedback of a user on an item, under the network interference by the item's exposure to the user's neighbors; and (2) adjusting the exposure of a target item to target users' neighbors to trade off steering performance and the damage to the neighbors' experience. To this end, we propose a Neighbor Interference Recommendation (NIRec) framework with two key modules: (1)an interference representation-based estimation module for modeling potential feedback; and (2) a post-learning-based optimization module for optimizing a target item's exposure to trade off steering performance and the neighbors' experience by greedy search. We conduct extensive semi-simulation experiments based on three real-world datasets, validating the steering effectiveness of NIRec.

Via

Access Paper or Ask Questions

Negative Sampling in Recommendation: A Survey and Future Directions

Sep 11, 2024

Haokai Ma, Ruobing Xie, Lei Meng, Fuli Feng, Xiaoyu Du, Xingwu Sun, Zhanhui Kang, Xiangxu Meng

Figure 1 for Negative Sampling in Recommendation: A Survey and Future Directions

Figure 2 for Negative Sampling in Recommendation: A Survey and Future Directions

Figure 3 for Negative Sampling in Recommendation: A Survey and Future Directions

Figure 4 for Negative Sampling in Recommendation: A Survey and Future Directions

Abstract:Recommender systems aim to capture users' personalized preferences from the cast amount of user behaviors, making them pivotal in the era of information explosion. However, the presence of the dynamic preference, the "information cocoons", and the inherent feedback loops in recommendation make users interact with a limited number of items. Conventional recommendation algorithms typically focus on the positive historical behaviors, while neglecting the essential role of negative feedback in user interest understanding. As a promising but easy-to-ignored area, negative sampling is proficients in revealing the genuine negative aspect inherent in user behaviors, emerging as an inescapable procedure in recommendation. In this survey, we first discuss the role of negative sampling in recommendation and thoroughly analyze challenges that consistently impede its progress. Then, we conduct an extensive literature review on the existing negative sampling strategies in recommendation and classify them into five categories with their discrepant techniques. Finally, we detail the insights of the tailored negative sampling strategies in diverse recommendation scenarios and outline an overview of the prospective research directions toward which the community may engage and benefit.

* 38 pages, 9 figures; Under review

Via

Access Paper or Ask Questions

Incorporate LLMs with Influential Recommender System

Sep 07, 2024

Mingze Wang, Shuxian Bi, Wenjie Wang, Chongming Gao, Yangyang Li, Fuli Feng

Figure 1 for Incorporate LLMs with Influential Recommender System

Figure 2 for Incorporate LLMs with Influential Recommender System

Figure 3 for Incorporate LLMs with Influential Recommender System

Abstract:Recommender systems have achieved increasing accuracy over the years. However, this precision often leads users to narrow their interests, resulting in issues such as limited diversity and the creation of echo chambers. Current research addresses these challenges through proactive recommender systems by recommending a sequence of items (called influence path) to guide user interest in the target item. However, existing methods struggle to construct a coherent influence path that builds up with items the user is likely to enjoy. In this paper, we leverage the Large Language Model's (LLMs) exceptional ability for path planning and instruction following, introducing a novel approach named LLM-based Influence Path Planning (LLM-IPP). Our approach maintains coherence between consecutive recommendations and enhances user acceptability of the recommended items. To evaluate LLM-IPP, we implement various user simulators and metrics to measure user acceptability and path coherence. Experimental results demonstrate that LLM-IPP significantly outperforms traditional proactive recommender systems. This study pioneers the integration of LLMs into proactive recommender systems, offering a reliable and user-engaging methodology for future recommendation technologies.

* 5 pages, 1 figure

Via

Access Paper or Ask Questions

Debias Can be Unreliable: Mitigating Bias Issue in Evaluating Debiasing Recommendation

Sep 07, 2024

Chengbing Wang, Wentao Shi, Jizhi Zhang, Wenjie Wang, Hang Pan, Fuli Feng

Figure 1 for Debias Can be Unreliable: Mitigating Bias Issue in Evaluating Debiasing Recommendation

Figure 2 for Debias Can be Unreliable: Mitigating Bias Issue in Evaluating Debiasing Recommendation

Figure 3 for Debias Can be Unreliable: Mitigating Bias Issue in Evaluating Debiasing Recommendation

Figure 4 for Debias Can be Unreliable: Mitigating Bias Issue in Evaluating Debiasing Recommendation

Abstract:Recent work has improved recommendation models remarkably by equipping them with debiasing methods. Due to the unavailability of fully-exposed datasets, most existing approaches resort to randomly-exposed datasets as a proxy for evaluating debiased models, employing traditional evaluation scheme to represent the recommendation performance. However, in this study, we reveal that traditional evaluation scheme is not suitable for randomly-exposed datasets, leading to inconsistency between the Recall performance obtained using randomly-exposed datasets and that obtained using fully-exposed datasets. Such inconsistency indicates the potential unreliability of experiment conclusions on previous debiasing techniques and calls for unbiased Recall evaluation using randomly-exposed datasets. To bridge the gap, we propose the Unbiased Recall Evaluation (URE) scheme, which adjusts the utilization of randomly-exposed datasets to unbiasedly estimate the true Recall performance on fully-exposed datasets. We provide theoretical evidence to demonstrate the rationality of URE and perform extensive experiments on real-world datasets to validate its soundness.

* 11 pages, 5 figures

Via

Access Paper or Ask Questions