Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Min-Yen Kan

Columbia University

Incorporating External Knowledge and Goal Guidance for LLM-based Conversational Recommender Systems

May 03, 2024

Chuang Li, Yang Deng, Hengchang Hu, Min-Yen Kan, Haizhou Li

Figure 1 for Incorporating External Knowledge and Goal Guidance for LLM-based Conversational Recommender Systems

Figure 2 for Incorporating External Knowledge and Goal Guidance for LLM-based Conversational Recommender Systems

Figure 3 for Incorporating External Knowledge and Goal Guidance for LLM-based Conversational Recommender Systems

Figure 4 for Incorporating External Knowledge and Goal Guidance for LLM-based Conversational Recommender Systems

Abstract:This paper aims to efficiently enable large language models (LLMs) to use external knowledge and goal guidance in conversational recommender system (CRS) tasks. Advanced LLMs (e.g., ChatGPT) are limited in domain-specific CRS tasks for 1) generating grounded responses with recommendation-oriented knowledge, or 2) proactively leading the conversations through different dialogue goals. In this work, we first analyze those limitations through a comprehensive evaluation, showing the necessity of external knowledge and goal guidance which contribute significantly to the recommendation accuracy and language quality. In light of this finding, we propose a novel ChatCRS framework to decompose the complex CRS task into several sub-tasks through the implementation of 1) a knowledge retrieval agent using a tool-augmented approach to reason over external Knowledge Bases and 2) a goal-planning agent for dialogue goal prediction. Experimental results on two multi-goal CRS datasets reveal that ChatCRS sets new state-of-the-art benchmarks, improving language quality of informativeness by 17% and proactivity by 27%, and achieving a tenfold enhancement in recommendation accuracy.

* Main paper 8 pages; References and Appendix 9 pages; 7 figures and 14 tables

Via

Access Paper or Ask Questions

Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

May 01, 2024

Yuxi Xie, Anirudh Goyal, Wenyue Zheng, Min-Yen Kan, Timothy P. Lillicrap, Kenji Kawaguchi, Michael Shieh

Figure 1 for Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

Figure 2 for Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

Figure 3 for Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

Figure 4 for Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

Abstract:We introduce an approach aimed at enhancing the reasoning capabilities of Large Language Models (LLMs) through an iterative preference learning process inspired by the successful strategy employed by AlphaZero. Our work leverages Monte Carlo Tree Search (MCTS) to iteratively collect preference data, utilizing its look-ahead ability to break down instance-level rewards into more granular step-level signals. To enhance consistency in intermediate steps, we combine outcome validation and stepwise self-evaluation, continually updating the quality assessment of newly generated data. The proposed algorithm employs Direct Preference Optimization (DPO) to update the LLM policy using this newly generated step-level preference data. Theoretical analysis reveals the critical importance of using on-policy sampled data for successful self-improving. Extensive evaluations on various arithmetic and commonsense reasoning tasks demonstrate remarkable performance improvements over existing models. For instance, our approach outperforms the Mistral-7B Supervised Fine-Tuning (SFT) baseline on GSM8K, MATH, and SciQ, with substantial percentage increases in accuracy to $80.7\%$ (+$4.8\%$), $32.2\%$ (+$3.3\%$), and $88.5\%$ (+$7.7\%$), respectively. Additionally, our research delves into the training and inference compute tradeoff, providing insights into how our method effectively maximizes performance gains.

Via

Access Paper or Ask Questions

ISQA: Informative Factuality Feedback for Scientific Summarization

Apr 20, 2024

Zekai Li, Yanxia Qin, Qian Liu, Min-Yen Kan

Figure 1 for ISQA: Informative Factuality Feedback for Scientific Summarization

Figure 2 for ISQA: Informative Factuality Feedback for Scientific Summarization

Figure 3 for ISQA: Informative Factuality Feedback for Scientific Summarization

Figure 4 for ISQA: Informative Factuality Feedback for Scientific Summarization

Abstract:We propose Iterative Facuality Refining on Informative Scientific Question-Answering (ISQA) feedback\footnote{Code is available at \url{https://github.com/lizekai-richard/isqa}}, a method following human learning theories that employs model-generated feedback consisting of both positive and negative information. Through iterative refining of summaries, it probes for the underlying rationale of statements to enhance the factuality of scientific summarization. ISQA does this in a fine-grained manner by asking a summarization agent to reinforce validated statements in positive feedback and fix incorrect ones in negative feedback. Our findings demonstrate that the ISQA feedback mechanism significantly improves the factuality of various open-source LLMs on the summarization task, as evaluated across multiple scientific datasets.

* 18 pages, 4 figures

Via

Access Paper or Ask Questions

Discrete Semantic Tokenization for Deep CTR Prediction

Mar 21, 2024

Qijiong Liu, Hengchang Hu, Jiahao Wu, Jieming Zhu, Min-Yen Kan, Xiao-Ming Wu

Abstract:Incorporating item content information into click-through rate (CTR) prediction models remains a challenge, especially with the time and space constraints of industrial scenarios. The content-encoding paradigm, which integrates user and item encoders directly into CTR models, prioritizes space over time. In contrast, the embedding-based paradigm transforms item and user semantics into latent embeddings, subsequently caching them to optimize processing time at the expense of space. In this paper, we introduce a new semantic-token paradigm and propose a discrete semantic tokenization approach, namely UIST, for user and item representation. UIST facilitates swift training and inference while maintaining a conservative memory footprint. Specifically, UIST quantizes dense embedding vectors into discrete tokens with shorter lengths and employs a hierarchical mixture inference module to weigh the contribution of each user--item token pair. Our experimental results on news recommendation showcase the effectiveness and efficiency (about 200-fold space compression) of UIST for CTR prediction.

* TheWebConf 2024 accepted paper

Via

Access Paper or Ask Questions

Beyond Memorization: The Challenge of Random Memory Access in Language Models

Mar 13, 2024

Tongyao Zhu, Qian Liu, Liang Pang, Zhengbao Jiang, Min-Yen Kan, Min Lin

Figure 1 for Beyond Memorization: The Challenge of Random Memory Access in Language Models

Figure 2 for Beyond Memorization: The Challenge of Random Memory Access in Language Models

Figure 3 for Beyond Memorization: The Challenge of Random Memory Access in Language Models

Figure 4 for Beyond Memorization: The Challenge of Random Memory Access in Language Models

Abstract:Recent developments in Language Models (LMs) have shown their effectiveness in NLP tasks, particularly in knowledge-intensive tasks. However, the mechanisms underlying knowledge storage and memory access within their parameters remain elusive. In this paper, we investigate whether a generative LM (e.g., GPT-2) is able to access its memory sequentially or randomly. Through carefully-designed synthetic tasks, covering the scenarios of full recitation, selective recitation and grounded question answering, we reveal that LMs manage to sequentially access their memory while encountering challenges in randomly accessing memorized content. We find that techniques including recitation and permutation improve the random memory access capability of LMs. Furthermore, by applying this intervention to realistic scenarios of open-domain question answering, we validate that enhancing random access by recitation leads to notable improvements in question answering. The code to reproduce our experiments can be found at https://github.com/sail-sg/lm-random-memory-access.

* 8 pages, 4 figures; fixed typos

Via

Access Paper or Ask Questions

NNOSE: Nearest Neighbor Occupational Skill Extraction

Jan 30, 2024

Mike Zhang, Rob van der Goot, Min-Yen Kan, Barbara Plank

Abstract:The labor market is changing rapidly, prompting increased interest in the automatic extraction of occupational skills from text. With the advent of English benchmark job description datasets, there is a need for systems that handle their diversity well. We tackle the complexity in occupational skill datasets tasks -- combining and leveraging multiple datasets for skill extraction, to identify rarely observed skills within a dataset, and overcoming the scarcity of skills across datasets. In particular, we investigate the retrieval-augmentation of language models, employing an external datastore for retrieving similar skills in a dataset-unifying manner. Our proposed method, \textbf{N}earest \textbf{N}eighbor \textbf{O}ccupational \textbf{S}kill \textbf{E}xtraction (NNOSE) effectively leverages multiple datasets by retrieving neighboring skills from other datasets in the datastore. This improves skill extraction \emph{without} additional fine-tuning. Crucially, we observe a performance gain in predicting infrequent patterns, with substantial gains of up to 30\% span-F1 in cross-dataset settings.

* Accepted at EACL 2024 Main

Via

Access Paper or Ask Questions

Lightweight Modality Adaptation to Sequential Recommendation via Correlation Supervision

Jan 14, 2024

Hengchang Hu, Qijiong Liu, Chuang Li, Min-Yen Kan

Figure 1 for Lightweight Modality Adaptation to Sequential Recommendation via Correlation Supervision

Figure 2 for Lightweight Modality Adaptation to Sequential Recommendation via Correlation Supervision

Figure 3 for Lightweight Modality Adaptation to Sequential Recommendation via Correlation Supervision

Figure 4 for Lightweight Modality Adaptation to Sequential Recommendation via Correlation Supervision

Abstract:In Sequential Recommenders (SR), encoding and utilizing modalities in an end-to-end manner is costly in terms of modality encoder sizes. Two-stage approaches can mitigate such concerns, but they suffer from poor performance due to modality forgetting, where the sequential objective overshadows modality representation. We propose a lightweight knowledge distillation solution that preserves both merits: retaining modality information and maintaining high efficiency. Specifically, we introduce a novel method that enhances the learning of embeddings in SR through the supervision of modality correlations. The supervision signals are distilled from the original modality representations, including both (1) holistic correlations, which quantify their overall associations, and (2) dissected correlation types, which refine their relationship facets (honing in on specific aspects like color or shape consistency). To further address the issue of modality forgetting, we propose an asynchronous learning step, allowing the original information to be retained longer for training the representation learning module. Our approach is compatible with various backbone architectures and outperforms the top baselines by 6.8% on average. We empirically demonstrate that preserving original feature associations from modality encoders significantly boosts task-specific recommendation adaptation. Additionally, we find that larger modality encoders (e.g., Large Language Models) contain richer feature sets which necessitate more fine-grained modeling to reach their full performance potential.

* Accepted by ECIR 2024

Via

Access Paper or Ask Questions

ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning

Nov 15, 2023

Xuan Long Do, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen

Figure 1 for ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning

Figure 2 for ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning

Figure 3 for ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning

Figure 4 for ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning

Abstract:Aligning language models (LMs) with human opinion is challenging yet vital to enhance their grasp of human values, preferences, and beliefs. We present ChOiRe, a four-step solution framework to predict human opinion that differentiates between the user explicit personae (i.e. demographic or ideological attributes) that are manually declared and implicit personae inferred from user historical opinions. Specifically, it consists of (i) an LM analyzing the user explicit personae to filter out irrelevant attributes; (ii) the LM ranking the implicit persona opinions into a preferential list; (iii) Chain-of-Opinion (CoO) reasoning, where the LM sequentially analyzes the explicit personae and the most relevant implicit personae to perform opinion prediction; (iv) and where ChOiRe executes Step (iii) CoO multiple times with increasingly larger lists of implicit personae to overcome insufficient personae information to infer a final result. ChOiRe achieves new state-of-the-art effectiveness with limited inference calls, improving previous LLM-based techniques significantly by 3.22%.

* 17 pages

Via

Access Paper or Ask Questions

CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation

Oct 24, 2023

Minzhi Li, Taiwei Shi, Caleb Ziems, Min-Yen Kan, Nancy F. Chen, Zhengyuan Liu, Diyi Yang

Figure 1 for CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation

Figure 2 for CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation

Figure 3 for CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation

Figure 4 for CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation

Abstract:Annotated data plays a critical role in Natural Language Processing (NLP) in training models and evaluating their performance. Given recent developments in Large Language Models (LLMs), models such as ChatGPT demonstrate zero-shot capability on many text-annotation tasks, comparable with or even exceeding human annotators. Such LLMs can serve as alternatives for manual annotation, due to lower costs and higher scalability. However, limited work has leveraged LLMs as complementary annotators, nor explored how annotation work is best allocated among humans and LLMs to achieve both quality and cost objectives. We propose CoAnnotating, a novel paradigm for Human-LLM co-annotation of unstructured texts at scale. Under this framework, we utilize uncertainty to estimate LLMs' annotation capability. Our empirical study shows CoAnnotating to be an effective means to allocate work from results on different datasets, with up to 21% performance improvement over random baseline. For code implementation, see https://github.com/SALT-NLP/CoAnnotating.

Via

Access Paper or Ask Questions

UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking

Oct 16, 2023

Chuang Li, Yan Zhang, Min-Yen Kan, Haizhou Li

Abstract:Previous zero-shot dialogue state tracking (DST) methods only apply transfer learning, but ignore unlabelled data in the target domain. We transform zero-shot DST into few-shot DST by utilising such unlabelled data via joint and self-training methods. Our method incorporates auxiliary tasks that generate slot types as inverse prompts for main tasks, creating slot values during joint training. Cycle consistency between these two tasks enables the generation and selection of quality samples in unknown target domains for subsequent fine-tuning. This approach also facilitates automatic label creation, thereby optimizing the training and fine-tuning of DST models. We demonstrate this method's effectiveness on large language models in zero-shot scenarios, improving average joint goal accuracy by $8\%$ across all domains in MultiWOZ.

* 8 pages, 6 figures, 6 tables

Via

Access Paper or Ask Questions