Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Seunghwan Kim

Communication-Free Collective Navigation for a Swarm of UAVs via LiDAR-Based Deep Reinforcement Learning

Jan 20, 2026

Myong-Yol Choi, Hankyoul Ko, Hanse Cho, Changseung Kim, Seunghwan Kim, Jaemin Seo, Hyondong Oh

Abstract:This paper presents a deep reinforcement learning (DRL) based controller for collective navigation of unmanned aerial vehicle (UAV) swarms in communication-denied environments, enabling robust operation in complex, obstacle-rich environments. Inspired by biological swarms where informed individuals guide groups without explicit communication, we employ an implicit leader-follower framework. In this paradigm, only the leader possesses goal information, while follower UAVs learn robust policies using only onboard LiDAR sensing, without requiring any inter-agent communication or leader identification. Our system utilizes LiDAR point clustering and an extended Kalman filter for stable neighbor tracking, providing reliable perception independent of external positioning systems. The core of our approach is a DRL controller, trained in GPU-accelerated Nvidia Isaac Sim, that enables followers to learn complex emergent behaviors - balancing flocking and obstacle avoidance - using only local perception. This allows the swarm to implicitly follow the leader while robustly addressing perceptual challenges such as occlusion and limited field-of-view. The robustness and sim-to-real transfer of our approach are confirmed through extensive simulations and challenging real-world experiments with a swarm of five UAVs, which successfully demonstrated collective navigation across diverse indoor and outdoor environments without any communication or external localization.

Via

Access Paper or Ask Questions

Neural Clustering for Prefractured Mesh Generation in Real-time Object Destruction

Feb 07, 2025

Seunghwan Kim, Sunha Park, Seungkyu Lee

$Figure 1 for Neural Clustering for Prefractured Mesh Generation in Real-time Object Destruction$

Abstract:Prefracture method is a practical implementation for real-time object destruction that is hardly achievable within performance constraints, but can produce unrealistic results due to its heuristic nature. To mitigate it, we approach the clustering of prefractured mesh generation as an unordered segmentation on point cloud data, and propose leveraging the deep neural network trained on a physics-based dataset. Our novel paradigm successfully predicts the structural weakness of object that have been limited, exhibiting ready-to-use results with remarkable quality.

Via

Access Paper or Ask Questions

Enhancing Exploration Efficiency using Uncertainty-Aware Information Prediction

Dec 17, 2024

Seunghwan Kim, Heejung Shin, Gaeun Yim, Changseung Kim, Hyondong Oh

Abstract:Autonomous exploration is a crucial aspect of robotics, enabling robots to explore unknown environments and generate maps without prior knowledge. This paper proposes a method to enhance exploration efficiency by integrating neural network-based occupancy grid map prediction with uncertainty-aware Bayesian neural network. Uncertainty from neural network-based occupancy grid map prediction is probabilistically integrated into mutual information for exploration. To demonstrate the effectiveness of the proposed method, we conducted comparative simulations within a frontier exploration framework in a realistic simulator environment against various information metrics. The proposed method showed superior performance in terms of exploration efficiency.

* 7pages

Via

Access Paper or Ask Questions

Design and Identification of Keypoint Patches in Unstructured Environments

Oct 01, 2024

Taewook Park, Seunghwan Kim, Hyondong Oh

Figure 1 for Design and Identification of Keypoint Patches in Unstructured Environments

Figure 2 for Design and Identification of Keypoint Patches in Unstructured Environments

Figure 3 for Design and Identification of Keypoint Patches in Unstructured Environments

Figure 4 for Design and Identification of Keypoint Patches in Unstructured Environments

Abstract:Reliable perception of targets is crucial for the stable operation of autonomous robots. A widely preferred method is keypoint identification in an image, as it allows direct mapping from raw images to 2D coordinates, facilitating integration with other algorithms like localization and path planning. In this study, we closely examine the design and identification of keypoint patches in cluttered environments, where factors such as blur and shadows can hinder detection. We propose four simple yet distinct designs that consider various scale, rotation and camera projection using a limited number of pixels. Additionally, we customize the Superpoint network to ensure robust detection under various types of image degradation. The effectiveness of our approach is demonstrated through real-world video tests, highlighting potential for vision-based autonomous systems.

* 12 pages, 8 figures, 7 tables

Via

Access Paper or Ask Questions

Beta-Sigma VAE: Separating beta and decoder variance in Gaussian variational autoencoder

Sep 14, 2024

Seunghwan Kim, Seungkyu Lee

Figure 1 for Beta-Sigma VAE: Separating beta and decoder variance in Gaussian variational autoencoder

Figure 2 for Beta-Sigma VAE: Separating beta and decoder variance in Gaussian variational autoencoder

Figure 3 for Beta-Sigma VAE: Separating beta and decoder variance in Gaussian variational autoencoder

Figure 4 for Beta-Sigma VAE: Separating beta and decoder variance in Gaussian variational autoencoder

Abstract:Variational autoencoder (VAE) is an established generative model but is notorious for its blurriness. In this work, we investigate the blurry output problem of VAE and resolve it, exploiting the variance of Gaussian decoder and $\beta$ of beta-VAE. Specifically, we reveal that the indistinguishability of decoder variance and $\beta$ hinders appropriate analysis of the model by random likelihood value, and limits performance improvement by omitting the gain from $\beta$. To address the problem, we propose Beta-Sigma VAE (BS-VAE) that explicitly separates $\beta$ and decoder variance $\sigma^2_x$ in the model. Our method demonstrates not only superior performance in natural image synthesis but also controllable parameters and predictable analysis compared to conventional VAE. In our experimental evaluation, we employ the analysis of rate-distortion curve and proxy metrics on computer vision datasets. The code is available on https://github.com/overnap/BS-VAE

* Accepted for ICPR 2024

Via

Access Paper or Ask Questions

Diversity and stylization of the contemporary user-generated visual arts in the complexity-entropy plane

Aug 21, 2024

Seunghwan Kim, Byunghwee Lee, Wonjae Lee

Figure 1 for Diversity and stylization of the contemporary user-generated visual arts in the complexity-entropy plane

Figure 2 for Diversity and stylization of the contemporary user-generated visual arts in the complexity-entropy plane

Figure 3 for Diversity and stylization of the contemporary user-generated visual arts in the complexity-entropy plane

Figure 4 for Diversity and stylization of the contemporary user-generated visual arts in the complexity-entropy plane

Abstract:The advent of computational and numerical methods in recent times has provided new avenues for analyzing art historiographical narratives and tracing the evolution of art styles therein. Here, we investigate an evolutionary process underpinning the emergence and stylization of contemporary user-generated visual art styles using the complexity-entropy (C-H) plane, which quantifies local structures in paintings. Informatizing 149,780 images curated in DeviantArt and Behance platforms from 2010 to 2020, we analyze the relationship between local information of the C-H space and multi-level image features generated by a deep neural network and a feature extraction algorithm. The results reveal significant statistical relationships between the C-H information of visual artistic styles and the dissimilarities of the multi-level image features over time within groups of artworks. By disclosing a particular C-H region where the diversity of image representations is noticeably manifested, our analyses reveal an empirical condition of emerging styles that are both novel in the C-H plane and characterized by greater stylistic diversity. Our research shows that visual art analyses combined with physics-inspired methodologies and machine learning, can provide macroscopic insights into quantitatively mapping relevant characteristics of an evolutionary process underpinning the creative stylization of uncharted visual arts of given groups and time.

* 18 pages, 3 figures, 1 table, SI(4 figures, 3 tables)

Via

Access Paper or Ask Questions

Autoregressive Language Models For Estimating the Entropy of Epic EHR Audit Logs

Nov 26, 2023

Benjamin C. Warner, Thomas Kannampallil, Seunghwan Kim

Figure 1 for Autoregressive Language Models For Estimating the Entropy of Epic EHR Audit Logs

Figure 2 for Autoregressive Language Models For Estimating the Entropy of Epic EHR Audit Logs

Figure 3 for Autoregressive Language Models For Estimating the Entropy of Epic EHR Audit Logs

Figure 4 for Autoregressive Language Models For Estimating the Entropy of Epic EHR Audit Logs

Abstract:EHR audit logs are a highly granular stream of events that capture clinician activities, and is a significant area of interest for research in characterizing clinician workflow on the electronic health record (EHR). Existing techniques to measure the complexity of workflow through EHR audit logs (audit logs) involve time- or frequency-based cross-sectional aggregations that are unable to capture the full complexity of a EHR session. We briefly evaluate the usage of transformer-based tabular language model (tabular LM) in measuring the entropy or disorderedness of action sequences within workflow and release the evaluated models publicly.

* Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 10 pages

Via

Access Paper or Ask Questions