Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sunwoo Kim

A Survey on Hypergraph Neural Networks: An In-Depth and Step-By-Step Guide

Apr 01, 2024

Sunwoo Kim, Soo Yong Lee, Yue Gao, Alessia Antelmi, Mirko Polato, Kijung Shin

Figure 1 for A Survey on Hypergraph Neural Networks: An In-Depth and Step-By-Step Guide

Figure 2 for A Survey on Hypergraph Neural Networks: An In-Depth and Step-By-Step Guide

Figure 3 for A Survey on Hypergraph Neural Networks: An In-Depth and Step-By-Step Guide

Figure 4 for A Survey on Hypergraph Neural Networks: An In-Depth and Step-By-Step Guide

Abstract:Higher-order interactions (HOIs) are ubiquitous in real-world complex systems and applications, and thus investigation of deep learning for HOIs has become a valuable agenda for the data mining and machine learning communities. As networks of HOIs are expressed mathematically as hypergraphs, hypergraph neural networks (HNNs) have emerged as a powerful tool for representation learning on hypergraphs. Given the emerging trend, we present the first survey dedicated to HNNs, with an in-depth and step-by-step guide. Broadly, the present survey overviews HNN architectures, training strategies, and applications. First, we break existing HNNs down into four design components: (i) input features, (ii) input structures, (iii) message-passing schemes, and (iv) training strategies. Second, we examine how HNNs address and learn HOIs with each of their components. Third, we overview the recent applications of HNNs in recommendation, biological and medical science, time series analysis, and computer vision. Lastly, we conclude with a discussion on limitations and future directions.

Via

Access Paper or Ask Questions

HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs

Mar 31, 2024

Sunwoo Kim, Shinhwan Kang, Fanchen Bu, Soo Yong Lee, Jaemin Yoo, Kijung Shin

Figure 1 for HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs

Figure 2 for HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs

Figure 3 for HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs

Figure 4 for HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs

Abstract:Hypergraphs are marked by complex topology, expressing higher-order interactions among multiple nodes with hyperedges, and better capturing the topology is essential for effective representation learning. Recent advances in generative self-supervised learning (SSL) suggest that hypergraph neural networks learned from generative self supervision have the potential to effectively encode the complex hypergraph topology. Designing a generative SSL strategy for hypergraphs, however, is not straightforward. Questions remain with regard to its generative SSL task, connection to downstream tasks, and empirical properties of learned representations. In light of the promises and challenges, we propose a novel generative SSL strategy for hypergraphs. We first formulate a generative SSL task on hypergraphs, hyperedge filling, and highlight its theoretical connection to node classification. Based on the generative SSL task, we propose a hypergraph SSL method, HypeBoy. HypeBoy learns effective general-purpose hypergraph representations, outperforming 16 baseline methods across 11 benchmark datasets.

* Published as a conference paper at ICLR 2024

Via

Access Paper or Ask Questions

FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer

Mar 21, 2024

Dongyeong Hwang, Hyunju Kim, Sunwoo Kim, Kijung Shin

Abstract:The success of a specific neural network architecture is closely tied to the dataset and task it tackles; there is no one-size-fits-all solution. Thus, considerable efforts have been made to quickly and accurately estimate the performances of neural architectures, without full training or evaluation, for given tasks and datasets. Neural architecture encoding has played a crucial role in the estimation, and graphbased methods, which treat an architecture as a graph, have shown prominent performance. For enhanced representation learning of neural architectures, we introduce FlowerFormer, a powerful graph transformer that incorporates the information flows within a neural architecture. FlowerFormer consists of two key components: (a) bidirectional asynchronous message passing, inspired by the flows; (b) global attention built on flow-based masking. Our extensive experiments demonstrate the superiority of FlowerFormer over existing neural encoding methods, and its effectiveness extends beyond computer vision models to include graph neural networks and auto speech recognition models. Our code is available at http://github.com/y0ngjaenius/CVPR2024_FLOWERFormer.

* CVPR 2024 Camera-Ready

Via

Access Paper or Ask Questions

SLADE: Detecting Dynamic Anomalies in Edge Streams without Labels via Self-Supervised Learning

Feb 19, 2024

Jongha Lee, Sunwoo Kim, Kijung Shin

Abstract:To detect anomalies in real-world graphs, such as social, email, and financial networks, various approaches have been developed. While they typically assume static input graphs, most real-world graphs grow over time, naturally represented as edge streams. In this context, we aim to achieve three goals: (a) instantly detecting anomalies as they occur, (b) adapting to dynamically changing states, and (c) handling the scarcity of dynamic anomaly labels. In this paper, we propose SLADE (Self-supervised Learning for Anomaly Detection in Edge Streams) for rapid detection of dynamic anomalies in edge streams, without relying on labels. SLADE detects the shifts of nodes into abnormal states by observing deviations in their interaction patterns over time. To this end, it trains a deep neural network to perform two self-supervised tasks: (a) minimizing drift in node representations and (b) generating long-term interaction patterns from short-term ones. Failure in these tasks for a node signals its deviation from the norm. Notably, the neural network and tasks are carefully designed so that all required operations can be performed in constant time (w.r.t. the graph size) in response to each new edge in the input stream. In dynamic anomaly detection across four real-world datasets, SLADE outperforms nine competing methods, even those leveraging label supervision.

* 15 pages, 6 figures

Via

Access Paper or Ask Questions

Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily Perspective

Feb 07, 2024

Soo Yong Lee, Sunwoo Kim, Fanchen Bu, Jaemin Yoo, Jiliang Tang, Kijung Shin

Figure 1 for Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily Perspective

Figure 2 for Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily Perspective

Figure 3 for Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily Perspective

Figure 4 for Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily Perspective

Abstract:How would randomly shuffling feature vectors among nodes from the same class affect graph neural networks (GNNs)? The feature shuffle, intuitively, perturbs the dependence between graph topology and features (A-X dependence) for GNNs to learn from. Surprisingly, we observe a consistent and significant improvement in GNN performance following the feature shuffle. Having overlooked the impact of A-X dependence on GNNs, the prior literature does not provide a satisfactory understanding of the phenomenon. Thus, we raise two research questions. First, how should A-X dependence be measured, while controlling for potential confounds? Second, how does A-X dependence affect GNNs? In response, we (i) propose a principled measure for A-X dependence, (ii) design a random graph model that controls A-X dependence, (iii) establish a theory on how A-X dependence relates to graph convolution, and (iv) present empirical analysis on real-world graphs that aligns with the theory. We conclude that A-X dependence mediates the effect of graph convolution, such that smaller dependence improves GNN-based node classification.

Via

Access Paper or Ask Questions

Channel Estimation for Reconfigurable Intelligent Surface Aided mmWave MU-MIMO Systems : Hybrid Receiver Architectures

Jan 13, 2024

Jeongjae Lee, Hyeongjin Chung, Sunwoo Kim, Songnam Hong

Figure 1 for Channel Estimation for Reconfigurable Intelligent Surface Aided mmWave MU-MIMO Systems : Hybrid Receiver Architectures

Figure 2 for Channel Estimation for Reconfigurable Intelligent Surface Aided mmWave MU-MIMO Systems : Hybrid Receiver Architectures

Figure 3 for Channel Estimation for Reconfigurable Intelligent Surface Aided mmWave MU-MIMO Systems : Hybrid Receiver Architectures

Figure 4 for Channel Estimation for Reconfigurable Intelligent Surface Aided mmWave MU-MIMO Systems : Hybrid Receiver Architectures

Abstract:Channel estimation is one of the key challenges for the deployment of reconfigurable intelligence surface (RIS)-aided communication systems. In this paper, we study the channel estimation problem of RIS-aided mmWave multi-user multiple-input multiple-output (MU-MIMO) systems especially when a hybrid receiver architecture is adopted. For this system, we propose a simple yet efficient channel estimation method using the fact that cascaded channels (to be estimated) have low-dimensional common column space. In the proposed method, the reflection vectors at the RIS and the RF combining matrices at the BS are designed such that the training observations are suitable for estimating the common column space and the user-specific coefficient matrices via a collaborative low-rank approximation. Via simulations, we demonstrate the effectiveness of the proposed channel estimation method compared with the state-of-the-art ones.

* IEEE ICC 2024 Sixth International Workshop on Next Generation Antenna Technologies for Future Wireless Networks: Extra Large-MIMO, Reconfigurable Intelligent Surfaces, and Cell-Free Massive MIMO

Via

Access Paper or Ask Questions

Classification of Edge-dependent Labels of Nodes in Hypergraphs

Jun 05, 2023

Minyoung Choe, Sunwoo Kim, Jaemin Yoo, Kijung Shin

Figure 1 for Classification of Edge-dependent Labels of Nodes in Hypergraphs

Figure 2 for Classification of Edge-dependent Labels of Nodes in Hypergraphs

Figure 3 for Classification of Edge-dependent Labels of Nodes in Hypergraphs

Figure 4 for Classification of Edge-dependent Labels of Nodes in Hypergraphs

Abstract:A hypergraph is a data structure composed of nodes and hyperedges, where each hyperedge is an any-sized subset of nodes. Due to the flexibility in hyperedge size, hypergraphs represent group interactions (e.g., co-authorship by more than two authors) more naturally and accurately than ordinary graphs. Interestingly, many real-world systems modeled as hypergraphs contain edge-dependent node labels, i.e., node labels that vary depending on hyperedges. For example, on co-authorship datasets, the same author (i.e., a node) can be the primary author in a paper (i.e., a hyperedge) but the corresponding author in another paper (i.e., another hyperedge). In this work, we introduce a classification of edge-dependent node labels as a new problem. This problem can be used as a benchmark task for hypergraph neural networks, which recently have attracted great attention, and also the usefulness of edge-dependent node labels has been verified in various applications. To tackle this problem, we propose WHATsNet, a novel hypergraph neural network that represents the same node differently depending on the hyperedges it participates in by reflecting its varying importance in the hyperedges. To this end, WHATsNet models the relations between nodes within each hyperedge, using their relative centrality as positional encodings. In our experiments, we demonstrate that WHATsNet significantly and consistently outperforms ten competitors on six real-world hypergraphs, and we also show successful applications of WHATsNet to (a) ranking aggregation, (b) node clustering, and (c) product return prediction.

* Accepted to KDD 2023

Via

Access Paper or Ask Questions

User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques

Jun 05, 2023

Sunwoo Kim, Wooseok Jang, Hyunsu Kim, Junho Kim, Yunjey Choi, Seungryong Kim, Gayeong Lee

Figure 1 for User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques

Figure 2 for User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques

Figure 3 for User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques

Figure 4 for User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques

Abstract:Recent text-driven image editing in diffusion models has shown remarkable success. However, the existing methods assume that the user's description sufficiently grounds the contexts in the source image, such as objects, background, style, and their relations. This assumption is unsuitable for real-world applications because users have to manually engineer text prompts to find optimal descriptions for different images. From the users' standpoint, prompt engineering is a labor-intensive process, and users prefer to provide a target word for editing instead of a full sentence. To address this problem, we first demonstrate the importance of a detailed text description of the source image, by dividing prompts into three categories based on the level of semantic details. Then, we propose simple yet effective methods by combining prompt generation frameworks, thereby making the prompt engineering process more user-friendly. Extensive qualitative and quantitative experiments demonstrate the importance of prompts in text-driven image editing and our method is comparable to ground-truth prompts.

Via

Access Paper or Ask Questions

DiffMatch: Diffusion Model for Dense Matching

May 30, 2023

Jisu Nam, Gyuseong Lee, Sunwoo Kim, Hyeonsu Kim, Hyoungwon Cho, Seyeon Kim, Seungryong Kim

Figure 1 for DiffMatch: Diffusion Model for Dense Matching

Figure 2 for DiffMatch: Diffusion Model for Dense Matching

Figure 3 for DiffMatch: Diffusion Model for Dense Matching

Figure 4 for DiffMatch: Diffusion Model for Dense Matching

Abstract:The objective for establishing dense correspondence between paired images consists of two terms: a data term and a prior term. While conventional techniques focused on defining hand-designed prior terms, which are difficult to formulate, recent approaches have focused on learning the data term with deep neural networks without explicitly modeling the prior, assuming that the model itself has the capacity to learn an optimal prior from a large-scale dataset. The performance improvement was obvious, however, they often fail to address inherent ambiguities of matching, such as textureless regions, repetitive patterns, and large displacements. To address this, we propose DiffMatch, a novel conditional diffusion-based framework designed to explicitly model both the data and prior terms. Unlike previous approaches, this is accomplished by leveraging a conditional denoising diffusion model. DiffMatch consists of two main components: conditional denoising diffusion module and cost injection module. We stabilize the training process and reduce memory usage with a stage-wise training strategy. Furthermore, to boost performance, we introduce an inference technique that finds a better path to the accurate matching field. Our experimental results demonstrate significant performance improvements of our method over existing approaches, and the ablation studies validate our design choices along with the effectiveness of each component. Project page is available at https://ku-cvlab.github.io/DiffMatch/.

* Project page is available at https://ku-cvlab.github.io/DiffMatch/

Via

Access Paper or Ask Questions

Semantic-Preserving Augmentation for Robust Image-Text Retrieval

Mar 10, 2023

Sunwoo Kim, Kyuhong Shim, Luong Trung Nguyen, Byonghyo Shim

Abstract:Image text retrieval is a task to search for the proper textual descriptions of the visual world and vice versa. One challenge of this task is the vulnerability to input image and text corruptions. Such corruptions are often unobserved during the training, and degrade the retrieval model decision quality substantially. In this paper, we propose a novel image text retrieval technique, referred to as robust visual semantic embedding (RVSE), which consists of novel image-based and text-based augmentation techniques called semantic preserving augmentation for image (SPAugI) and text (SPAugT). Since SPAugI and SPAugT change the original data in a way that its semantic information is preserved, we enforce the feature extractors to generate semantic aware embedding vectors regardless of the corruption, improving the model robustness significantly. From extensive experiments using benchmark datasets, we show that RVSE outperforms conventional retrieval schemes in terms of image-text retrieval performance.

* Accepted to ICASSP 2023

Via

Access Paper or Ask Questions