Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hyunsoo Kim

When Model Knowledge meets Diffusion Model: Diffusion-assisted Data-free Image Synthesis with Alignment of Domain and Class

Jun 18, 2025

Yujin Kim, Hyunsoo Kim, Hyunwoo J. Kim, Suhyun Kim

Abstract:Open-source pre-trained models hold great potential for diverse applications, but their utility declines when their training data is unavailable. Data-Free Image Synthesis (DFIS) aims to generate images that approximate the learned data distribution of a pre-trained model without accessing the original data. However, existing DFIS meth ods produce samples that deviate from the training data distribution due to the lack of prior knowl edge about natural images. To overcome this limitation, we propose DDIS, the first Diffusion-assisted Data-free Image Synthesis method that leverages a text-to-image diffusion model as a powerful image prior, improving synthetic image quality. DDIS extracts knowledge about the learned distribution from the given model and uses it to guide the diffusion model, enabling the generation of images that accurately align with the training data distribution. To achieve this, we introduce Domain Alignment Guidance (DAG) that aligns the synthetic data domain with the training data domain during the diffusion sampling process. Furthermore, we optimize a single Class Alignment Token (CAT) embedding to effectively capture class-specific attributes in the training dataset. Experiments on PACS and Ima geNet demonstrate that DDIS outperforms prior DFIS methods by generating samples that better reflect the training data distribution, achieving SOTA performance in data-free applications.

* Published at ICML 2025

Via

Access Paper or Ask Questions

Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation

Jun 09, 2025

Hyunsoo Kim, Donghyun Kim, Suhyun Kim

Abstract:How can we generate an image B' that satisfies A:A'::B:B', given the input images A,A' and B? Recent works have tackled this challenge through approaches like visual in-context learning or visual instruction. However, these methods are typically limited to specific models (e.g. InstructPix2Pix. Inpainting models) rather than general diffusion models (e.g. Stable Diffusion, SDXL). This dependency may lead to inherited biases or lower editing capabilities. In this paper, we propose Difference Inversion, a method that isolates only the difference from A and A' and applies it to B to generate a plausible B'. To address model dependency, it is crucial to structure prompts in the form of a "Full Prompt" suitable for input to stable diffusion models, rather than using an "Instruction Prompt". To this end, we accurately extract the Difference between A and A' and combine it with the prompt of B, enabling a plug-and-play application of the difference. To extract a precise difference, we first identify it through 1) Delta Interpolation. Additionally, to ensure accurate training, we propose the 2) Token Consistency Loss and 3) Zero Initialization of Token Embeddings. Our extensive experiments demonstrate that Difference Inversion outperforms existing baselines both quantitatively and qualitatively, indicating its ability to generate more feasible B' in a model-agnostic manner.

* Published at CVPR 2025

Via

Access Paper or Ask Questions

Ultra-reliable urban air mobility networks

Oct 23, 2024

Hyunsoo Kim

Figure 1 for Ultra-reliable urban air mobility networks

Figure 2 for Ultra-reliable urban air mobility networks

Figure 3 for Ultra-reliable urban air mobility networks

Figure 4 for Ultra-reliable urban air mobility networks

Abstract:Recently, urban air mobility (UAM) has attracted attention as an emerging technology that will bring innovation to urban transportation and aviation systems. Since the UAM systems pursue fully autonomous flight without a pilot, wireless communication is a key function not only for flight control signals, but also for navigation and safety information. The essential information is called a command and control (C2) message, and the UAM networks must be configured so that the UAM can receive the C2 message by securing a continuous link stability without any interruptions. Nevertheless, a lot of prior works have focused only on improving the average performance without solving the low-reliability in the cell edges and coverage holes of urban areas. In this dissertation, we identify the factors that hinder the communication link reliability in considering three-dimensional (3D) urban environments, and propose a antenna configuration, resource utilization, and transmission strategy to enable UAM receiving C2 messages regardless of time and space. First, through stochastic geometry modeling, we analyze the signal blockage effects caused by the urban buildings. The blockage probability is calculated according to the shape, height, and density of the buildings, and the coverage probability of the received signal is derived by reflecting the blockage events. Furthermore, the low-reliability area is identified by analyzing the coverage performance according to the positions of the UAMs. To overcome the low-reliability region, we propose three methods for UAM network operation: i) optimization of antennas elevation tilting, ii) frequency reuse with multi-layered narrow beam, and iii) assistive transmissions by the master UAM.

* PhD thesis, 64 pages, 24 figures, 3 tables

Via

Access Paper or Ask Questions

MARS: Matching Attribute-aware Representations for Text-based Sequential Recommendation

Sep 04, 2024

Hyunsoo Kim, Junyoung Kim, Minjin Choi, Sunkyung Lee, Jongwuk Lee

Abstract:Sequential recommendation aims to predict the next item a user is likely to prefer based on their sequential interaction history. Recently, text-based sequential recommendation has emerged as a promising paradigm that uses pre-trained language models to exploit textual item features to enhance performance and facilitate knowledge transfer to unseen datasets. However, existing text-based recommender models still struggle with two key challenges: (i) representing users and items with multiple attributes, and (ii) matching items with complex user interests. To address these challenges, we propose a novel model, Matching Attribute-aware Representations for Text-based Sequential Recommendation (MARS). MARS extracts detailed user and item representations through attribute-aware text encoding, capturing diverse user intents with multiple attribute-aware representations. It then computes user-item scores via attribute-wise interaction matching, effectively capturing attribute-level user preferences. Our extensive experiments demonstrate that MARS significantly outperforms existing sequential models, achieving improvements of up to 24.43% and 29.26% in Recall@10 and NDCG@10 across five benchmark datasets. Code is available at https://github.com/junieberry/MARS

* CIKM 2024

Via

Access Paper or Ask Questions

Node Embedding for Homophilous Graphs with ARGEW: Augmentation of Random walks by Graph Edge Weights

Aug 11, 2023

Jun Hee Kim, Jaeman Son, Hyunsoo Kim, Eunjo Lee

Abstract:Representing nodes in a network as dense vectors node embeddings is important for understanding a given network and solving many downstream tasks. In particular, for weighted homophilous graphs where similar nodes are connected with larger edge weights, we desire node embeddings where node pairs with strong weights have closer embeddings. Although random walk based node embedding methods like node2vec and node2vec+ do work for weighted networks via including edge weights in the walk transition probabilities, our experiments show that the embedding result does not adequately reflect edge weights. In this paper, we propose ARGEW (Augmentation of Random walks by Graph Edge Weights), a novel augmentation method for random walks that expands the corpus in such a way that nodes with larger edge weights end up with closer embeddings. ARGEW can work with any random walk based node embedding method, because it is independent of the random sampling strategy itself and works on top of the already-performed walks. With several real-world networks, we demonstrate that with ARGEW, compared to not using it, the desired pattern that node pairs with larger edge weights have closer embeddings is much clearer. We also examine ARGEW's performance in node classification: node2vec with ARGEW outperforms pure node2vec and is not sensitive to hyperparameters (i.e. consistently good). In fact, it achieves similarly good results as supervised GCN, even without any node feature or label information during training. Finally, we explain why ARGEW works consistently well by exploring the coappearance distributions using a synthetic graph with clear structural roles.

Via

Access Paper or Ask Questions

Gradient-based Bit Encoding Optimization for Noise-Robust Binary Memristive Crossbar

Jan 05, 2022

Youngeun Kim, Hyunsoo Kim, Seijoon Kim, Sang Joon Kim, Priyadarshini Panda

Figure 1 for Gradient-based Bit Encoding Optimization for Noise-Robust Binary Memristive Crossbar

Figure 2 for Gradient-based Bit Encoding Optimization for Noise-Robust Binary Memristive Crossbar

Figure 3 for Gradient-based Bit Encoding Optimization for Noise-Robust Binary Memristive Crossbar

Figure 4 for Gradient-based Bit Encoding Optimization for Noise-Robust Binary Memristive Crossbar

Abstract:Binary memristive crossbars have gained huge attention as an energy-efficient deep learning hardware accelerator. Nonetheless, they suffer from various noises due to the analog nature of the crossbars. To overcome such limitations, most previous works train weight parameters with noise data obtained from a crossbar. These methods are, however, ineffective because it is difficult to collect noise data in large-volume manufacturing environment where each crossbar has a large device/circuit level variation. Moreover, we argue that there is still room for improvement even though these methods somewhat improve accuracy. This paper explores a new perspective on mitigating crossbar noise in a more generalized way by manipulating input binary bit encoding rather than training the weight of networks with respect to noise data. We first mathematically show that the noise decreases as the number of binary bit encoding pulses increases when representing the same amount of information. In addition, we propose Gradient-based Bit Encoding Optimization (GBO) which optimizes a different number of pulses at each layer, based on our in-depth analysis that each layer has a different level of noise sensitivity. The proposed heterogeneous layer-wise bit encoding scheme achieves high noise robustness with low computational cost. Our experimental results on public benchmark datasets show that GBO improves the classification accuracy by ~5-40% in severe noise scenarios.

* Accepted to DATE2022

Via

Access Paper or Ask Questions

Learning to Discover Cross-Domain Relations with Generative Adversarial Networks

May 15, 2017

Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, Jiwon Kim

Figure 1 for Learning to Discover Cross-Domain Relations with Generative Adversarial Networks

Figure 2 for Learning to Discover Cross-Domain Relations with Generative Adversarial Networks

Figure 3 for Learning to Discover Cross-Domain Relations with Generative Adversarial Networks

Figure 4 for Learning to Discover Cross-Domain Relations with Generative Adversarial Networks

Abstract:While humans easily recognize relations between data from different domains without any supervision, learning to automatically discover them is in general very challenging and needs many ground-truth pairs that illustrate the relations. To avoid costly pairing, we address the task of discovering cross-domain relations given unpaired data. We propose a method based on generative adversarial networks that learns to discover relations between different domains (DiscoGAN). Using the discovered relations, our proposed network successfully transfers style from one domain to another while preserving key attributes such as orientation and face identity. Source code for official implementation is publicly available https://github.com/SKTBrain/DiscoGAN

* Accepted to International Conference on Machine Learning (ICML) 2017

Via

Access Paper or Ask Questions