Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hong Cheng

All in One: Multi-Task Prompting for Graph Neural Networks

Mar 11, 2024

Xiangguo Sun, Hong Cheng, Jia Li, Bo Liu, Jihong Guan

Abstract:This paper is an extended abstract of our original work published in KDD23, where we won the best research paper award (Xiangguo Sun, Hong Cheng, Jia Li, Bo Liu, and Jihong Guan. All in one: Multi-task prompting for graph neural networks. KDD 23) The paper introduces a novel approach to bridging the gap between pre-trained graph models and the diverse tasks they're applied to, inspired by the success of prompt learning in NLP. Recognizing the challenge of aligning pre-trained models with varied graph tasks (node level, edge level, and graph level), which can lead to negative transfer and poor performance, we propose a multi-task prompting method for graphs. This method involves unifying graph and language prompt formats, enabling NLP's prompting strategies to be adapted for graph tasks. By analyzing the task space of graph applications, we reformulate problems to fit graph-level tasks and apply meta-learning to improve prompt initialization for multiple tasks. Experiments show our method's effectiveness in enhancing model performance across different graph tasks. Beyond the original work, in this extended abstract, we further discuss the graph prompt from a bigger picture and provide some of the latest work toward this area.

* submitted to IJCAI 2024 Sister Conferences Track. The original paper can be seen at arXiv:2307.01504

Via

Access Paper or Ask Questions

All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining

Feb 15, 2024

Haihong Zhao, Aochuan Chen, Xiangguo Sun, Hong Cheng, Jia Li

Figure 1 for All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining

Figure 2 for All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining

Figure 3 for All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining

Figure 4 for All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining

Abstract:Large Language Models (LLMs) have revolutionized the fields of computer vision (CV) and natural language processing (NLP). One of the most notable advancements of LLMs is that a single model is trained on vast and diverse datasets spanning multiple domains -- a paradigm we term `All in One'. This methodology empowers LLMs with super generalization capabilities, facilitating an encompassing comprehension of varied data distributions. Leveraging these capabilities, a single LLM demonstrates remarkable versatility across a variety of domains -- a paradigm we term `One for All'. However, applying this idea to the graph field remains a formidable challenge, with cross-domain pretraining often resulting in negative transfer. This issue is particularly important in few-shot learning scenarios, where the paucity of training data necessitates the incorporation of external knowledge sources. In response to this challenge, we propose a novel approach called Graph COordinators for PrEtraining (GCOPE), that harnesses the underlying commonalities across diverse graph datasets to enhance few-shot learning. Our novel methodology involves a unification framework that amalgamates disparate graph datasets during the pretraining phase to distill and transfer meaningful knowledge to target tasks. Extensive experiments across multiple graph datasets demonstrate the superior efficacy of our approach. By successfully leveraging the synergistic potential of multiple graph datasets for pretraining, our work stands as a pioneering contribution to the realm of graph foundational model.

Via

Access Paper or Ask Questions

Graph Prompt Learning: A Comprehensive Survey and Beyond

Nov 28, 2023

Xiangguo Sun, Jiawen Zhang, Xixi Wu, Hong Cheng, Yun Xiong, Jia Li

Figure 1 for Graph Prompt Learning: A Comprehensive Survey and Beyond

Figure 2 for Graph Prompt Learning: A Comprehensive Survey and Beyond

Figure 3 for Graph Prompt Learning: A Comprehensive Survey and Beyond

Figure 4 for Graph Prompt Learning: A Comprehensive Survey and Beyond

Abstract:Artificial General Intelligence (AGI) has revolutionized numerous fields, yet its integration with graph data, a cornerstone in our interconnected world, remains nascent. This paper presents a pioneering survey on the emerging domain of graph prompts in AGI, addressing key challenges and opportunities in harnessing graph data for AGI applications. Despite substantial advancements in AGI across natural language processing and computer vision, the application to graph data is relatively underexplored. This survey critically evaluates the current landscape of AGI in handling graph data, highlighting the distinct challenges in cross-modality, cross-domain, and cross-task applications specific to graphs. Our work is the first to propose a unified framework for understanding graph prompt learning, offering clarity on prompt tokens, token structures, and insertion patterns in the graph domain. We delve into the intrinsic properties of graph prompts, exploring their flexibility, expressiveness, and interplay with existing graph models. A comprehensive taxonomy categorizes over 100 works in this field, aligning them with pre-training tasks across node-level, edge-level, and graph-level objectives. Additionally, we present, ProG, a Python library, and an accompanying website, to support and advance research in graph prompting. The survey culminates in a discussion of current challenges and future directions, offering a roadmap for research in graph prompting within AGI. Through this comprehensive analysis, we aim to catalyze further exploration and practical applications of AGI in graph data, underlining its potential to reshape AGI fields and beyond. ProG and the website can be accessed by \url{https://github.com/WxxShirley/Awesome-Graph-Prompt}, and \url{https://github.com/sheldonresearch/ProG}, respectively.

Via

Access Paper or Ask Questions

A Survey of Graph Meets Large Language Model: Progress and Future Directions

Nov 28, 2023

Yuhan Li, Zhixun Li, Peisong Wang, Jia Li, Xiangguo Sun, Hong Cheng, Jeffrey Xu Yu

Abstract:Graph plays a significant role in representing and analyzing complex relationships in real-world applications such as citation networks, social networks, and biological data. Recently, Large Language Models (LLMs), which have achieved tremendous success in various domains, have also been leveraged in graph-related tasks to surpass traditional Graph Neural Networks (GNNs) based methods and yield state-of-the-art performance. In this survey, we first present a comprehensive review and analysis of existing methods that integrate LLMs with graphs. First of all, we propose a new taxonomy, which organizes existing methods into three categories based on the role (i.e., enhancer, predictor, and alignment component) played by LLMs in graph-related tasks. Then we systematically survey the representative methods along the three categories of the taxonomy. Finally, we discuss the remaining limitations of existing studies and highlight promising avenues for future research. The relevant papers are summarized and will be consistently updated at: https://github.com/yhLeeee/Awesome-LLMs-in-Graph-tasks.

* Work in progress; 13 pages, 5 figures

Via

Access Paper or Ask Questions

Counter-Empirical Attacking based on Adversarial Reinforcement Learning for Time-Relevant Scoring System

Nov 09, 2023

Xiangguo Sun, Hong Cheng, Hang Dong, Bo Qiao, Si Qin, Qingwei Lin

Figure 1 for Counter-Empirical Attacking based on Adversarial Reinforcement Learning for Time-Relevant Scoring System

Figure 2 for Counter-Empirical Attacking based on Adversarial Reinforcement Learning for Time-Relevant Scoring System

Figure 3 for Counter-Empirical Attacking based on Adversarial Reinforcement Learning for Time-Relevant Scoring System

Figure 4 for Counter-Empirical Attacking based on Adversarial Reinforcement Learning for Time-Relevant Scoring System

Abstract:Scoring systems are commonly seen for platforms in the era of big data. From credit scoring systems in financial services to membership scores in E-commerce shopping platforms, platform managers use such systems to guide users towards the encouraged activity pattern, and manage resources more effectively and more efficiently thereby. To establish such scoring systems, several "empirical criteria" are firstly determined, followed by dedicated top-down design for each factor of the score, which usually requires enormous effort to adjust and tune the scoring function in the new application scenario. What's worse, many fresh projects usually have no ground-truth or any experience to evaluate a reasonable scoring system, making the designing even harder. To reduce the effort of manual adjustment of the scoring function in every new scoring system, we innovatively study the scoring system from the preset empirical criteria without any ground truth, and propose a novel framework to improve the system from scratch. In this paper, we propose a "counter-empirical attacking" mechanism that can generate "attacking" behavior traces and try to break the empirical rules of the scoring system. Then an adversarial "enhancer" is applied to evaluate the scoring system and find the improvement strategy. By training the adversarial learning problem, a proper scoring function can be learned to be robust to the attacking activity traces that are trying to violate the empirical criteria. Extensive experiments have been conducted on two scoring systems including a shared computing resource platform and a financial credit system. The experimental results have validated the effectiveness of our proposed framework.

* submitted to TKDE on 08-Jun-2022, receive the 1st round decision (major revision) on 20-Apr-2023, submitted to TKDE 2nd time on 30-May-2023, receive the 2nd round decision (major revision) on 30-Sep-2023, submitted to TKDE 3rd time on 15-Oct-2023, now under review for the 3rd round of reviewing

Via

Access Paper or Ask Questions

Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale

Nov 02, 2023

Qichao Wang, Tian Bian, Yian Yin, Tingyang Xu, Hong Cheng, Helen M. Meng, Zibin Zheng, Liang Chen, Bingzhe Wu

Figure 1 for Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale

Figure 2 for Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale

Figure 3 for Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale

Figure 4 for Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale

Abstract:The recent surge in the research of diffusion models has accelerated the adoption of text-to-image models in various Artificial Intelligence Generated Content (AIGC) commercial products. While these exceptional AIGC products are gaining increasing recognition and sparking enthusiasm among consumers, the questions regarding whether, when, and how these models might unintentionally reinforce existing societal stereotypes remain largely unaddressed. Motivated by recent advancements in language agents, here we introduce a novel agent architecture tailored for stereotype detection in text-to-image models. This versatile agent architecture is capable of accommodating free-form detection tasks and can autonomously invoke various tools to facilitate the entire process, from generating corresponding instructions and images, to detecting stereotypes. We build the stereotype-relevant benchmark based on multiple open-text datasets, and apply this architecture to commercial products and popular open source text-to-image models. We find that these models often display serious stereotypes when it comes to certain prompts about personal characteristics, social cultural context and crime-related aspects. In summary, these empirical findings underscore the pervasive existence of stereotypes across social dimensions, including gender, race, and religion, which not only validate the effectiveness of our proposed approach, but also emphasize the critical necessity of addressing potential ethical risks in the burgeoning realm of AIGC. As AIGC continues its rapid expansion trajectory, with new models and plugins emerging daily in staggering numbers, the challenge lies in the timely detection and mitigation of potential biases within these models.

Via

Access Paper or Ask Questions

GPU Accelerated Color Correction and Frame Warping for Real-time Video Stitching

Aug 17, 2023

Lu Yang, Zhenglun Kong, Ting Li, Xinyi Bai, Zhiye Lin, Hong Cheng

Figure 1 for GPU Accelerated Color Correction and Frame Warping for Real-time Video Stitching

Figure 2 for GPU Accelerated Color Correction and Frame Warping for Real-time Video Stitching

Figure 3 for GPU Accelerated Color Correction and Frame Warping for Real-time Video Stitching

Figure 4 for GPU Accelerated Color Correction and Frame Warping for Real-time Video Stitching

Abstract:Traditional image stitching focuses on a single panorama frame without considering the spatial-temporal consistency in videos. The straightforward image stitching approach will cause temporal flicking and color inconstancy when it is applied to the video stitching task. Besides, inaccurate camera parameters will cause artifacts in the image warping. In this paper, we propose a real-time system to stitch multiple video sequences into a panoramic video, which is based on GPU accelerated color correction and frame warping without accurate camera parameters. We extend the traditional 2D-Matrix (2D-M) color correction approach and a present spatio-temporal 3D-Matrix (3D-M) color correction method for the overlap local regions with online color balancing using a piecewise function on global frames. Furthermore, we use pairwise homography matrices given by coarse camera calibration for global warping followed by accurate local warping based on the optical flow. Experimental results show that our system can generate highquality panorama videos in real time.

Via

Access Paper or Ask Questions

Modified Topological Image Preprocessing for Skin Lesion Classifications

Aug 13, 2023

Hong Cheng, Rebekah Leamons, Ahmad Al Shami

Abstract:This paper proposes a modified Topological Data Analysis model for skin images preprocessing and enhancements. The skin lesion dataset HAM10000 used with the intention of identifying the important objects in relevant regions of the images. In order to evaluate both the original dataset and the preprocessed dataset, Deep Convolutional Neural Network and Vision Transformer models were utilized to train both models. After training, the experimental results demonstrate that the images preprocessed using the Modified Topological Data Analysis consistently perform better.

* Presented at CSCE 2022, The 2022 World Congress in Computer Science, Computer Engineering & Applied Computing, July 25-28, 2022, Las Vegas, USA

Via

Access Paper or Ask Questions

Target-point Attention Transformer: A novel trajectory predict network for end-to-end autonomous driving

Aug 03, 2023

Jingyu Du, Yang Zhao, Hong Cheng

Abstract:In the field of autonomous driving, there have been many excellent perception models for object detection, semantic segmentation, and other tasks, but how can we effectively use the perception models for vehicle planning? Traditional autonomous vehicle trajectory prediction methods not only need to obey traffic rules to avoid collisions, but also need to follow the prescribed route to reach the destination. In this paper, we propose a Transformer-based trajectory prediction network for end-to-end autonomous driving without rules called Target-point Attention Transformer network (TAT). We use the attention mechanism to realize the interaction between the predicted trajectory and the perception features as well as target-points. We demonstrate that our proposed method outperforms existing conditional imitation learning and GRU-based methods, significantly reducing the occurrence of accidents and improving route completion. We evaluate our approach in complex closed loop driving scenarios in cities using the CARLA simulator and achieve state-of-the-art performance.

* 7 pages, 4 figures, 44 conference

Via

Access Paper or Ask Questions

AutoAlign: Fully Automatic and Effective Knowledge Graph Alignment enabled by Large Language Models

Jul 18, 2023

Rui Zhang, Yixin Su, Bayu Distiawan Trisedya, Xiaoyan Zhao, Min Yang, Hong Cheng, Jianzhong Qi

Figure 1 for AutoAlign: Fully Automatic and Effective Knowledge Graph Alignment enabled by Large Language Models

Figure 2 for AutoAlign: Fully Automatic and Effective Knowledge Graph Alignment enabled by Large Language Models

Figure 3 for AutoAlign: Fully Automatic and Effective Knowledge Graph Alignment enabled by Large Language Models

Figure 4 for AutoAlign: Fully Automatic and Effective Knowledge Graph Alignment enabled by Large Language Models

Abstract:The task of entity alignment between knowledge graphs (KGs) aims to identify every pair of entities from two different KGs that represent the same entity. Many machine learning-based methods have been proposed for this task. However, to our best knowledge, existing methods all require manually crafted seed alignments, which are expensive to obtain. In this paper, we propose the first fully automatic alignment method named AutoAlign, which does not require any manually crafted seed alignments. Specifically, for predicate embeddings, AutoAlign constructs a predicate-proximity-graph with the help of large language models to automatically capture the similarity between predicates across two KGs. For entity embeddings, AutoAlign first computes the entity embeddings of each KG independently using TransE, and then shifts the two KGs' entity embeddings into the same vector space by computing the similarity between entities based on their attributes. Thus, both predicate alignment and entity alignment can be done without manually crafted seed alignments. AutoAlign is not only fully automatic, but also highly effective. Experiments using real-world KGs show that AutoAlign improves the performance of entity alignment significantly compared to state-of-the-art methods.

* 14 pages, 5 figures, 4 tables. arXiv admin note: substantial text overlap with arXiv:2210.08540

Via

Access Paper or Ask Questions