Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yanhui Li

Decoding Hidden Deception in Reasoning LLMs: Activation Explainers for Deception Auditing

Jun 16, 2026

Kexin Chen, Yi Liu, Haonan Zhang, Yanhui Li, Xinyu Deng, Dongxia Wang

Abstract:As LLMs acquire stronger reasoning capabilities, deceptive behavior becomes an increasingly serious safety concern. Existing deception monitors either score visible transcripts or derive scalar probe scores from representation vectors, leaving little inspectable evidence about why a response is suspicious. We introduce STATEWITNESS, an activation explainer for deception auditing. A separate decoder reads a target model's hidden states, then answers natural-language queries or emits structured reports about them. We evaluate STATEWITNESS on two target reasoning LLMs across seven deception datasets. STATEWITNESS reaches 0.916 mean AUROC, a relative gain of 11.6% over the best black-box text monitor and 25.0% over the best activation-probe baseline under the same evaluation protocol. When combined with existing monitors, STATEWITNESS reduces missed deceptive examples in simple threshold ensembles. Beyond scalar detection, the decoder returns query-level answers, schema reports, and token- or sentence-level evidence traces for human inspection. We view this interface as a potential building block for broader interpretability and alignment tools.

* Under review

Via

Access Paper or Ask Questions

Enhancing Serendipity Recommendation System by Constructing Dynamic User Knowledge Graphs with Large Language Models

Aug 06, 2025

Qian Yong, Yanhui Li, Jialiang Shi, Yaguang Dou, Tian Qi

Figure 1 for Enhancing Serendipity Recommendation System by Constructing Dynamic User Knowledge Graphs with Large Language Models

Figure 2 for Enhancing Serendipity Recommendation System by Constructing Dynamic User Knowledge Graphs with Large Language Models

Figure 3 for Enhancing Serendipity Recommendation System by Constructing Dynamic User Knowledge Graphs with Large Language Models

Figure 4 for Enhancing Serendipity Recommendation System by Constructing Dynamic User Knowledge Graphs with Large Language Models

Abstract:The feedback loop in industrial recommendation systems reinforces homogeneous content, creates filter bubble effects, and diminishes user satisfaction. Recently, large language models(LLMs) have demonstrated potential in serendipity recommendation, thanks to their extensive world knowledge and superior reasoning capabilities. However, these models still face challenges in ensuring the rationality of the reasoning process, the usefulness of the reasoning results, and meeting the latency requirements of industrial recommendation systems (RSs). To address these challenges, we propose a method that leverages llm to dynamically construct user knowledge graphs, thereby enhancing the serendipity of recommendation systems. This method comprises a two stage framework:(1) two-hop interest reasoning, where user static profiles and historical behaviors are utilized to dynamically construct user knowledge graphs via llm. Two-hop reasoning, which can enhance the quality and accuracy of LLM reasoning results, is then performed on the constructed graphs to identify users' potential interests; and(2) Near-line adaptation, a cost-effective approach to deploying the aforementioned models in industrial recommendation systems. We propose a u2i (user-to-item) retrieval model that also incorporates i2i (item-to-item) retrieval capabilities, the retrieved items not only exhibit strong relevance to users' newly emerged interests but also retain the high conversion rate of traditional u2i retrieval. Our online experiments on the Dewu app, which has tens of millions of users, indicate that the method increased the exposure novelty rate by 4.62%, the click novelty rate by 4.85%, the average view duration per person by 0.15%, unique visitor click through rate by 0.07%, and unique visitor interaction penetration by 0.30%, enhancing user experience.

* 8 pages

Via

Access Paper or Ask Questions

LightKG: Efficient Knowledge-Aware Recommendations with Simplified GNN Architecture

Jun 12, 2025

Yanhui Li, Dongxia Wang, Zhu Sun, Haonan Zhang, Huizhong Guo

Figure 1 for LightKG: Efficient Knowledge-Aware Recommendations with Simplified GNN Architecture

Figure 2 for LightKG: Efficient Knowledge-Aware Recommendations with Simplified GNN Architecture

Figure 3 for LightKG: Efficient Knowledge-Aware Recommendations with Simplified GNN Architecture

Figure 4 for LightKG: Efficient Knowledge-Aware Recommendations with Simplified GNN Architecture

Abstract:Recently, Graph Neural Networks (GNNs) have become the dominant approach for Knowledge Graph-aware Recommender Systems (KGRSs) due to their proven effectiveness. Building upon GNN-based KGRSs, Self-Supervised Learning (SSL) has been incorporated to address the sparity issue, leading to longer training time. However, through extensive experiments, we reveal that: (1)compared to other KGRSs, the existing GNN-based KGRSs fail to keep their superior performance under sparse interactions even with SSL. (2) More complex models tend to perform worse in sparse interaction scenarios and complex mechanisms, like attention mechanism, can be detrimental as they often increase learning difficulty. Inspired by these findings, we propose LightKG, a simple yet powerful GNN-based KGRS to address sparsity issues. LightKG includes a simplified GNN layer that encodes directed relations as scalar pairs rather than dense embeddings and employs a linear aggregation framework, greatly reducing the complexity of GNNs. Additionally, LightKG incorporates an efficient contrastive layer to implement SSL. It directly minimizes the node similarity in original graph, avoiding the time-consuming subgraph generation and comparison required in previous SSL methods. Experiments on four benchmark datasets show that LightKG outperforms 12 competitive KGRSs in both sparse and dense scenarios while significantly reducing training time. Specifically, it surpasses the best baselines by an average of 5.8\% in recommendation accuracy and saves 84.3\% of training time compared to KGRSs with SSL. Our code is available at https://github.com/1371149/LightKG.

* Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Via

Access Paper or Ask Questions

Does Knowledge Graph Really Matter for Recommender Systems?

Apr 04, 2024

Haonan Zhang, Dongxia Wang, Zhu Sun, Yanhui Li, Youcheng Sun, Huizhi Liang, Wenhai Wang

Figure 1 for Does Knowledge Graph Really Matter for Recommender Systems?

Figure 2 for Does Knowledge Graph Really Matter for Recommender Systems?

Figure 3 for Does Knowledge Graph Really Matter for Recommender Systems?

Figure 4 for Does Knowledge Graph Really Matter for Recommender Systems?

Abstract:Recommender systems (RSs) are designed to provide personalized recommendations to users. Recently, knowledge graphs (KGs) have been widely introduced in RSs to improve recommendation accuracy. In this study, however, we demonstrate that RSs do not necessarily perform worse even if the KG is downgraded to the user-item interaction graph only (or removed). We propose an evaluation framework KG4RecEval to systematically evaluate how much a KG contributes to the recommendation accuracy of a KG-based RS, using our defined metric KGER (KG utilization efficiency in recommendation). We consider the scenarios where knowledge in a KG gets completely removed, randomly distorted and decreased, and also where recommendations are for cold-start users. Our extensive experiments on four commonly used datasets and a number of state-of-the-art KG-based RSs reveal that: to remove, randomly distort or decrease knowledge does not necessarily decrease recommendation accuracy, even for cold-start users. These findings inspire us to rethink how to better utilize knowledge from existing KGs, whereby we discuss and provide insights into what characteristics of datasets and KG-based RSs may help improve KG utilization efficiency.

Via

Access Paper or Ask Questions

Causality-Aided Trade-off Analysis for Machine Learning Fairness

May 22, 2023

Zhenlan Ji, Pingchuan Ma, Shuai Wang, Yanhui Li

Figure 1 for Causality-Aided Trade-off Analysis for Machine Learning Fairness

Figure 2 for Causality-Aided Trade-off Analysis for Machine Learning Fairness

Figure 3 for Causality-Aided Trade-off Analysis for Machine Learning Fairness

Figure 4 for Causality-Aided Trade-off Analysis for Machine Learning Fairness

Abstract:There has been an increasing interest in enhancing the fairness of machine learning (ML). Despite the growing number of fairness-improving methods, we lack a systematic understanding of the trade-offs among factors considered in the ML pipeline when fairness-improving methods are applied. This understanding is essential for developers to make informed decisions regarding the provision of fair ML services. Nonetheless, it is extremely difficult to analyze the trade-offs when there are multiple fairness parameters and other crucial metrics involved, coupled, and even in conflict with one another. This paper uses causality analysis as a principled method for analyzing trade-offs between fairness parameters and other crucial metrics in ML pipelines. To ractically and effectively conduct causality analysis, we propose a set of domain-specific optimizations to facilitate accurate causal discovery and a unified, novel interface for trade-off analysis based on well-established causal inference methods. We conduct a comprehensive empirical study using three real-world datasets on a collection of widelyused fairness-improving techniques. Our study obtains actionable suggestions for users and developers of fair ML. We further demonstrate the versatile usage of our approach in selecting the optimal fairness-improving method, paving the way for more ethical and socially responsible AI technologies.

Via

Access Paper or Ask Questions

Measuring Discrimination to Boost Comparative Testing for Multiple Deep Learning Models

Mar 09, 2021

Linghan Meng, Yanhui Li, Lin Chen, Zhi Wang, Di Wu, Yuming Zhou, Baowen Xu

Figure 1 for Measuring Discrimination to Boost Comparative Testing for Multiple Deep Learning Models

Figure 2 for Measuring Discrimination to Boost Comparative Testing for Multiple Deep Learning Models

Figure 3 for Measuring Discrimination to Boost Comparative Testing for Multiple Deep Learning Models

Figure 4 for Measuring Discrimination to Boost Comparative Testing for Multiple Deep Learning Models

Abstract:The boom of DL technology leads to massive DL models built and shared, which facilitates the acquisition and reuse of DL models. For a given task, we encounter multiple DL models available with the same functionality, which are considered as candidates to achieve this task. Testers are expected to compare multiple DL models and select the more suitable ones w.r.t. the whole testing context. Due to the limitation of labeling effort, testers aim to select an efficient subset of samples to make an as precise rank estimation as possible for these models. To tackle this problem, we propose Sample Discrimination based Selection (SDS) to select efficient samples that could discriminate multiple models, i.e., the prediction behaviors (right/wrong) of these samples would be helpful to indicate the trend of model performance. To evaluate SDS, we conduct an extensive empirical study with three widely-used image datasets and 80 real world DL models. The experimental results show that, compared with state-of-the-art baseline methods, SDS is an effective and efficient sample selection method to rank multiple DL models.

* ICSE'2021

Via

Access Paper or Ask Questions

Connecting Software Metrics across Versions to Predict Defects

Dec 28, 2017

Yibin Liu, Yanhui Li, Jianbo Guo, Yuming Zhou, Baowen Xu

Figure 1 for Connecting Software Metrics across Versions to Predict Defects

Figure 2 for Connecting Software Metrics across Versions to Predict Defects

Figure 3 for Connecting Software Metrics across Versions to Predict Defects

Figure 4 for Connecting Software Metrics across Versions to Predict Defects

Abstract:Accurate software defect prediction could help software practitioners allocate test resources to defect-prone modules effectively and efficiently. In the last decades, much effort has been devoted to build accurate defect prediction models, including developing quality defect predictors and modeling techniques. However, current widely used defect predictors such as code metrics and process metrics could not well describe how software modules change over the project evolution, which we believe is important for defect prediction. In order to deal with this problem, in this paper, we propose to use the Historical Version Sequence of Metrics (HVSM) in continuous software versions as defect predictors. Furthermore, we leverage Recurrent Neural Network (RNN), a popular modeling technique, to take HVSM as the input to build software prediction models. The experimental results show that, in most cases, the proposed HVSM-based RNN model has a significantly better effort-aware ranking effectiveness than the commonly used baseline models.

Via

Access Paper or Ask Questions