Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhuo Chen

refer to the report for detailed contributions

Self-Improvement Programming for Temporal Knowledge Graph Question Answering

Apr 02, 2024

Zhuo Chen, Zhao Zhang, Zixuan Li, Fei Wang, Yutao Zeng, Xiaolong Jin, Yongjun Xu

Figure 1 for Self-Improvement Programming for Temporal Knowledge Graph Question Answering

Figure 2 for Self-Improvement Programming for Temporal Knowledge Graph Question Answering

Figure 3 for Self-Improvement Programming for Temporal Knowledge Graph Question Answering

Figure 4 for Self-Improvement Programming for Temporal Knowledge Graph Question Answering

Abstract:Temporal Knowledge Graph Question Answering (TKGQA) aims to answer questions with temporal intent over Temporal Knowledge Graphs (TKGs). The core challenge of this task lies in understanding the complex semantic information regarding multiple types of time constraints (e.g., before, first) in questions. Existing end-to-end methods implicitly model the time constraints by learning time-aware embeddings of questions and candidate answers, which is far from understanding the question comprehensively. Motivated by semantic-parsing-based approaches that explicitly model constraints in questions by generating logical forms with symbolic operators, we design fundamental temporal operators for time constraints and introduce a novel self-improvement Programming method for TKGQA (Prog-TQA). Specifically, Prog-TQA leverages the in-context learning ability of Large Language Models (LLMs) to understand the combinatory time constraints in the questions and generate corresponding program drafts with a few examples given. Then, it aligns these drafts to TKGs with the linking module and subsequently executes them to generate the answers. To enhance the ability to understand questions, Prog-TQA is further equipped with a self-improvement strategy to effectively bootstrap LLMs using high-quality self-generated drafts. Extensive experiments demonstrate the superiority of the proposed Prog-TQA on MultiTQ and CronQuestions datasets, especially in the Hits@1 metric.

* Accepted by LREC-COLING 2024 (long paper)

Via

Access Paper or Ask Questions

Using Interpretation Methods for Model Enhancement

Apr 02, 2024

Zhuo Chen, Chengyue Jiang, Kewei Tu

Abstract:In the age of neural natural language processing, there are plenty of works trying to derive interpretations of neural models. Intuitively, when gold rationales exist during training, one can additionally train the model to match its interpretation with the rationales. However, this intuitive idea has not been fully explored. In this paper, we propose a framework of utilizing interpretation methods and gold rationales to enhance models. Our framework is very general in the sense that it can incorporate various interpretation methods. Previously proposed gradient-based methods can be shown as an instance of our framework. We also propose two novel instances utilizing two other types of interpretation methods, erasure/replace-based and extractor-based methods, for model enhancement. We conduct comprehensive experiments on a variety of tasks. Experimental results show that our framework is effective especially in low-resource settings in enhancing models with various interpretation methods, and our two newly-proposed methods outperform gradient-based methods in most settings. Code is available at https://github.com/Chord-Chen-30/UIMER.

* Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 424-438
* EMNLP 2023

Via

Access Paper or Ask Questions

Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs

Apr 01, 2024

Xiaoze Liu, Feijie Wu, Tianyang Xu, Zhuo Chen, Yichi Zhang, Xiaoqian Wang, Jing Gao

Figure 1 for Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs

Figure 2 for Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs

Figure 3 for Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs

Figure 4 for Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs

Abstract:The advent of Large Language Models (LLMs) has significantly transformed the AI landscape, enhancing machine learning and AI capabilities. Factuality issue is a critical concern for LLMs, as they may generate factually incorrect responses. In this paper, we propose GraphEval to evaluate an LLM's performance using a substantially large test dataset. Specifically, the test dataset is retrieved from a large knowledge graph with more than 10 million facts without expensive human efforts. Unlike conventional methods that evaluate LLMs based on generated responses, GraphEval streamlines the evaluation process by creating a judge model to estimate the correctness of the answers given by the LLM. Our experiments demonstrate that the judge model's factuality assessment aligns closely with the correctness of the LLM's generated outputs, while also substantially reducing evaluation costs. Besides, our findings offer valuable insights into LLM performance across different metrics and highlight the potential for future improvements in ensuring the factual integrity of LLM outputs. The code is publicly available at https://github.com/xz-liu/GraphEval.

Via

Access Paper or Ask Questions

The Power of Noise: Toward a Unified Multi-modal Knowledge Graph Representation Framework

Mar 20, 2024

Zhuo Chen, Yin Fang, Yichi Zhang, Lingbing Guo, Jiaoyan Chen, Huajun Chen, Wen Zhang

Figure 1 for The Power of Noise: Toward a Unified Multi-modal Knowledge Graph Representation Framework

Figure 2 for The Power of Noise: Toward a Unified Multi-modal Knowledge Graph Representation Framework

Figure 3 for The Power of Noise: Toward a Unified Multi-modal Knowledge Graph Representation Framework

Figure 4 for The Power of Noise: Toward a Unified Multi-modal Knowledge Graph Representation Framework

Abstract:The advancement of Multi-modal Pre-training highlights the necessity for a robust Multi-Modal Knowledge Graph (MMKG) representation learning framework. This framework is crucial for integrating structured knowledge into multi-modal Large Language Models (LLMs) at scale, aiming to alleviate issues like knowledge misconceptions and multi-modal hallucinations. In this work, to evaluate models' ability to accurately embed entities within MMKGs, we focus on two widely researched tasks: Multi-modal Knowledge Graph Completion (MKGC) and Multi-modal Entity Alignment (MMEA). Building on this foundation, we propose a novel SNAG method that utilizes a Transformer-based architecture equipped with modality-level noise masking for the robust integration of multi-modal entity features in KGs. By incorporating specific training objectives for both MKGC and MMEA, our approach achieves SOTA performance across a total of ten datasets (three for MKGC and seven for MEMA), demonstrating its robustness and versatility. Besides, SNAG can not only function as a standalone model but also enhance other existing methods, providing stable performance improvements. Our code and data are available at: https://github.com/zjukg/SNAG.

* Ongoing work; 10 pages, 6 Tables, 2 Figures; Repo is available at https://github.com/zjukg/SNAG

Via

Access Paper or Ask Questions

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

Feb 26, 2024

Zhuo Chen, Yichi Zhang, Yin Fang, Yuxia Geng, Lingbing Guo, Xiang Chen, Qian Li, Wen Zhang, Jiaoyan Chen, Yushan Zhu(+5 more)

Figure 1 for Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

Figure 2 for Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

Figure 3 for Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

Figure 4 for Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

Abstract:Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the semantic web community's exploration into multi-modal dimensions unlocking new avenues for innovation. In this survey, we carefully review over 300 articles, focusing on KG-aware research in two principal aspects: KG-driven Multi-Modal (KG4MM) learning, where KGs support multi-modal tasks, and Multi-Modal Knowledge Graph (MM4KG), which extends KG studies into the MMKG realm. We begin by defining KGs and MMKGs, then explore their construction progress. Our review includes two primary task categories: KG-aware multi-modal learning tasks, such as Image Classification and Visual Question Answering, and intrinsic MMKG tasks like Multi-modal Knowledge Graph Completion and Entity Alignment, highlighting specific research trajectories. For most of these tasks, we provide definitions, evaluation benchmarks, and additionally outline essential insights for conducting relevant research. Finally, we discuss current challenges and identify emerging trends, such as progress in Large Language Modeling and Multi-modal Pre-training strategies. This survey aims to serve as a comprehensive reference for researchers already involved in or considering delving into KG and multi-modal learning research, offering insights into the evolving landscape of MMKG research and supporting future work.

* Ongoing work; 41 pages (Main Text), 55 pages (Total), 11 Tables, 13 Figures, 619 citations; Paper list is available at https://github.com/zjukg/KG-MM-Survey

Via

Access Paper or Ask Questions

Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion

Feb 22, 2024

Yichi Zhang, Zhuo Chen, Lei Liang, Huajun Chen, Wen Zhang

Figure 1 for Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion

Figure 2 for Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion

Figure 3 for Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion

Figure 4 for Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion

Abstract:Multi-modal knowledge graph completion (MMKGC) aims to predict the missing triples in the multi-modal knowledge graphs by incorporating structural, visual, and textual information of entities into the discriminant models. The information from different modalities will work together to measure the triple plausibility. Existing MMKGC methods overlook the imbalance problem of modality information among entities, resulting in inadequate modal fusion and inefficient utilization of the raw modality information. To address the mentioned problems, we propose Adaptive Multi-modal Fusion and Modality Adversarial Training (AdaMF-MAT) to unleash the power of imbalanced modality information for MMKGC. AdaMF-MAT achieves multi-modal fusion with adaptive modality weights and further generates adversarial samples by modality-adversarial training to enhance the imbalanced modality information. Our approach is a co-design of the MMKGC model and training strategy which can outperform 19 recent MMKGC methods and achieve new state-of-the-art results on three public MMKGC benchmarks. Our code and data have been released at https://github.com/zjukg/AdaMF-MAT.

* Accepted by LREC-COLING 2024

Via

Access Paper or Ask Questions

ChatCell: Facilitating Single-Cell Analysis with Natural Language

Feb 20, 2024

Yin Fang, Kangwei Liu, Ningyu Zhang, Xinle Deng, Penghui Yang, Zhuo Chen, Xiangru Tang, Mark Gerstein, Xiaohui Fan, Huajun Chen

Figure 1 for ChatCell: Facilitating Single-Cell Analysis with Natural Language

Figure 2 for ChatCell: Facilitating Single-Cell Analysis with Natural Language

Figure 3 for ChatCell: Facilitating Single-Cell Analysis with Natural Language

Figure 4 for ChatCell: Facilitating Single-Cell Analysis with Natural Language

Abstract:As Large Language Models (LLMs) rapidly evolve, their influence in science is becoming increasingly prominent. The emerging capabilities of LLMs in task generalization and free-form dialogue can significantly advance fields like chemistry and biology. However, the field of single-cell biology, which forms the foundational building blocks of living organisms, still faces several challenges. High knowledge barriers and limited scalability in current methods restrict the full exploitation of LLMs in mastering single-cell data, impeding direct accessibility and rapid iteration. To this end, we introduce ChatCell, which signifies a paradigm shift by facilitating single-cell analysis with natural language. Leveraging vocabulary adaptation and unified sequence generation, ChatCell has acquired profound expertise in single-cell biology and the capability to accommodate a diverse range of analysis tasks. Extensive experiments further demonstrate ChatCell's robust performance and potential to deepen single-cell insights, paving the way for more accessible and intuitive exploration in this pivotal field. Our project homepage is available at https://zjunlp.github.io/project/ChatCell.

* I have decided to temporarily withdraw this draft as I am in the process of making further revisions to improve its content. Code: https://github.com/zjunlp/ChatCell Dataset: https://huggingface.co/datasets/zjunlp/ChatCell-Instructions Demo: https://chat.openai.com/g/g-vUwj222gQ-chatcell

Via

Access Paper or Ask Questions

ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment

Feb 16, 2024

Yangyifei Luo, Zhuo Chen, Lingbing Guo, Qian Li, Wenxuan Zeng, Zhixin Cai, Jianxin Li

Figure 1 for ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment

Figure 2 for ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment

Figure 3 for ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment

Figure 4 for ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment

Abstract:Entity alignment (EA) aims to identify entities across different knowledge graphs that represent the same real-world objects. Recent embedding-based EA methods have achieved state-of-the-art performance in EA yet faced interpretability challenges as they purely rely on the embedding distance and neglect the logic rules behind a pair of aligned entities. In this paper, we propose the Align-Subgraph Entity Alignment (ASGEA) framework to exploit logic rules from Align-Subgraphs. ASGEA uses anchor links as bridges to construct Align-Subgraphs and spreads along the paths across KGs, which distinguishes it from the embedding-based methods. Furthermore, we design an interpretable Path-based Graph Neural Network, ASGNN, to effectively identify and integrate the logic rules across KGs. We also introduce a node-level multi-modal attention mechanism coupled with multi-modal enriched anchors to augment the Align-Subgraph. Our experimental results demonstrate the superior performance of ASGEA over the existing embedding-based methods in both EA and Multi-Modal EA (MMEA) tasks. Our code will be available soon.

* Ongoing work; 16 pages, 9 Tables, 8 Figures

Via

Access Paper or Ask Questions

Power Transformer Fault Prediction Based on Knowledge Graphs

Feb 11, 2024

Chao Wang, Zhuo Chen, Ziyan Zhang, Chiyi Li, Kai Song

Abstract:In this paper, we address the challenge of learning with limited fault data for power transformers. Traditional operation and maintenance tools lack effective predictive capabilities for potential faults. The scarcity of extensive fault data makes it difficult to apply machine learning techniques effectively. To solve this problem, we propose a novel approach that leverages the knowledge graph (KG) technology in combination with gradient boosting decision trees (GBDT). This method is designed to efficiently learn from a small set of high-dimensional data, integrating various factors influencing transformer faults and historical operational data. Our approach enables accurate safe state assessments and fault analyses of power transformers despite the limited fault characteristic data. Experimental results demonstrate that this method outperforms other learning approaches in prediction accuracy, such as artificial neural networks (ANN) and logistic regression (LR). Furthermore, it offers significant improvements in progressiveness, practicality, and potential for widespread application.

Via

Access Paper or Ask Questions

StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion

Feb 07, 2024

Zhichao Wang, Yuanzhe Chen, Xinsheng Wang, Zhuo Chen, Lei Xie, Yuping Wang, Yuxuan Wang

Figure 1 for StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion

Figure 2 for StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion

Figure 3 for StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion

Figure 4 for StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion

Abstract:Recent language model (LM) advancements have showcased impressive zero-shot voice conversion (VC) performance. However, existing LM-based VC models usually apply offline conversion from source semantics to acoustic features, demanding the complete source speech, and limiting their deployment to real-time applications. In this paper, we introduce StreamVoice, a novel streaming LM-based model for zero-shot VC, facilitating real-time conversion given arbitrary speaker prompts and source speech. Specifically, to enable streaming capability, StreamVoice employs a fully causal context-aware LM with a temporal-independent acoustic predictor, while alternately processing semantic and acoustic features at each time step of autoregression which eliminates the dependence on complete source speech. To address the potential performance degradation from the incomplete context in streaming processing, we enhance the context-awareness of the LM through two strategies: 1) teacher-guided context foresight, using a teacher model to summarize the present and future semantic context during training to guide the model's forecasting for missing context; 2) semantic masking strategy, promoting acoustic prediction from preceding corrupted semantic and acoustic input, enhancing context-learning ability. Notably, StreamVoice is the first LM-based streaming zero-shot VC model without any future look-ahead. Experimental results demonstrate StreamVoice's streaming conversion capability while maintaining zero-shot performance comparable to non-streaming VC systems.

* There is an error in the submitted version. The author and institution information needs to be modified, and the company requires re-examination before submission

Via

Access Paper or Ask Questions