Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Minkyu Kim

Higher-order Neural Additive Models: An Interpretable Machine Learning Model with Feature Interactions

Sep 30, 2022
Minkyu Kim, Hyun-Soo Choi, Jinho Kim

Figure 1 for Higher-order Neural Additive Models: An Interpretable Machine Learning Model with Feature Interactions

Figure 2 for Higher-order Neural Additive Models: An Interpretable Machine Learning Model with Feature Interactions

Figure 3 for Higher-order Neural Additive Models: An Interpretable Machine Learning Model with Feature Interactions

Figure 4 for Higher-order Neural Additive Models: An Interpretable Machine Learning Model with Feature Interactions

Black-box models, such as deep neural networks, exhibit superior predictive performances, but understanding their behavior is notoriously difficult. Many explainable artificial intelligence methods have been proposed to reveal the decision-making processes of black box models. However, their applications in high-stakes domains remain limited. Recently proposed neural additive models (NAM) have achieved state-of-the-art interpretable machine learning. NAM can provide straightforward interpretations with slight performance sacrifices compared with multi-layer perceptron. However, NAM can only model 1$^{\text{st}}$-order feature interactions; thus, it cannot capture the co-relationships between input features. To overcome this problem, we propose a novel interpretable machine learning method called higher-order neural additive models (HONAM) and a feature interaction method for high interpretability. HONAM can model arbitrary orders of feature interactions. Therefore, it can provide the high predictive performance and interpretability that high-stakes domains need. In addition, we propose a novel hidden unit to effectively learn sharp-shape functions. We conducted experiments using various real-world datasets to examine the effectiveness of HONAM. Furthermore, we demonstrate that HONAM can achieve fair AI with a slight performance sacrifice. The source code for HONAM is publicly available.

Via

Access Paper or Ask Questions

Explicit Feature Interaction-aware Graph Neural Networks

Apr 07, 2022
Minkyu Kim, Hyun-Soo Choi, Jinho Kim

Figure 1 for Explicit Feature Interaction-aware Graph Neural Networks

Figure 2 for Explicit Feature Interaction-aware Graph Neural Networks

Figure 3 for Explicit Feature Interaction-aware Graph Neural Networks

Figure 4 for Explicit Feature Interaction-aware Graph Neural Networks

Graph neural networks are powerful methods to handle graph-structured data. However, existing graph neural networks only learn higher-order feature interactions implicitly. Thus, they cannot capture information that occurred in low-order feature interactions. To overcome this problem, we propose Explicit Feature Interaction-aware Graph Neural Network (EFI-GNN), which explicitly learns arbitrary-order feature interactions. EFI-GNN can jointly learn with any other graph neural network. We demonstrate that the joint learning method always enhances performance on the various node classification tasks. Furthermore, since EFI-GNN is inherently a linear model, we can interpret the prediction result of EFI-GNN. With the computation rule, we can obtain an any-order feature's effect on the decision. By that, we visualize the effects of the first-order and second-order features as a form of a heatmap.

* 9 pages, 9 figures, pre-print

Via

Access Paper or Ask Questions

KC-TSS: An Algorithm for Heterogeneous Robot Teams Performing Resilient Target Search

Mar 02, 2022
Minkyu Kim, Ryan Gupta, Luis Sentis

Figure 1 for KC-TSS: An Algorithm for Heterogeneous Robot Teams Performing Resilient Target Search

Figure 2 for KC-TSS: An Algorithm for Heterogeneous Robot Teams Performing Resilient Target Search

Figure 3 for KC-TSS: An Algorithm for Heterogeneous Robot Teams Performing Resilient Target Search

Figure 4 for KC-TSS: An Algorithm for Heterogeneous Robot Teams Performing Resilient Target Search

This paper proposes KC-TSS: K-Clustered-Traveling Salesman Based Search, a failure resilient path planning algorithm for heterogeneous robot teams performing target search in human environments. We separate the sample path generation problem into Heterogeneous Clustering and multiple Traveling Salesman Problems. This allows us to provide high-quality candidate paths (i.e. minimal backtracking, overlap) to an Information-Theoretic utility function for each agent. First, we generate waypoint candidates from map knowledge and a target prediction model. All of these candidates are clustered according to the number of agents and their ability to cover space, or coverage competency. Each agent solves a Traveling Salesman Problem (TSP) instance over their assigned cluster and then candidates are fed to a utility function for path selection. We perform extensive Gazebo simulations and preliminary deployment of real robots in indoor search and simulated rescue scenarios with static targets. We compare our proposed method against a state-of-the-art algorithm and show that ours is able to outperform it in mission time. Our method provides resilience in the event of single or multi teammate failure by recomputing global team plans online.

Via

Access Paper or Ask Questions

SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness

Nov 17, 2021
Jongheon Jeong, Sejun Park, Minkyu Kim, Heung-Chang Lee, Doguk Kim, Jinwoo Shin

Figure 1 for SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness

Figure 2 for SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness

Figure 3 for SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness

Figure 4 for SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness

Randomized smoothing is currently a state-of-the-art method to construct a certifiably robust classifier from neural networks against $\ell_2$-adversarial perturbations. Under the paradigm, the robustness of a classifier is aligned with the prediction confidence, i.e., the higher confidence from a smoothed classifier implies the better robustness. This motivates us to rethink the fundamental trade-off between accuracy and robustness in terms of calibrating confidences of a smoothed classifier. In this paper, we propose a simple training scheme, coined SmoothMix, to control the robustness of smoothed classifiers via self-mixup: it trains on convex combinations of samples along the direction of adversarial perturbation for each input. The proposed procedure effectively identifies over-confident, near off-class samples as a cause of limited robustness in case of smoothed classifiers, and offers an intuitive way to adaptively set a new decision boundary between these samples for better robustness. Our experimental results demonstrate that the proposed method can significantly improve the certified $\ell_2$-robustness of smoothed classifiers compared to existing state-of-the-art robust training methods.

* 24 pages; NeurIPS 2021; Code is available at https://github.com/jh-jeong/smoothmix

Via

Access Paper or Ask Questions

Global-Local Item Embedding for Temporal Set Prediction

Sep 05, 2021
Seungjae Jung, Young-Jin Park, Jisu Jeong, Kyung-Min Kim, Hiun Kim, Minkyu Kim, Hanock Kwak

Figure 1 for Global-Local Item Embedding for Temporal Set Prediction

Figure 2 for Global-Local Item Embedding for Temporal Set Prediction

Figure 3 for Global-Local Item Embedding for Temporal Set Prediction

Figure 4 for Global-Local Item Embedding for Temporal Set Prediction

Temporal set prediction is becoming increasingly important as many companies employ recommender systems in their online businesses, e.g., personalized purchase prediction of shopping baskets. While most previous techniques have focused on leveraging a user's history, the study of combining it with others' histories remains untapped potential. This paper proposes Global-Local Item Embedding (GLOIE) that learns to utilize the temporal properties of sets across whole users as well as within a user by coining the names as global and local information to distinguish the two temporal patterns. GLOIE uses Variational Autoencoder (VAE) and dynamic graph-based model to capture global and local information and then applies attention to integrate resulting item embeddings. Additionally, we propose to use Tweedie output for the decoder of VAE as it can easily model zero-inflated and long-tailed distribution, which is more suitable for several real-world data distributions than Gaussian or multinomial counterparts. When evaluated on three public benchmarks, our algorithm consistently outperforms previous state-of-the-art methods in most ranking metrics.

* 8 pages, 3 figures. To appear in RecSys 2021 LBR

Via

Access Paper or Ask Questions

Information-Theoretic Based Target Search with Multiple Agents

Jul 27, 2021
Minkyu Kim, Ryan Gupta, Luis Sentis

Figure 1 for Information-Theoretic Based Target Search with Multiple Agents

Figure 2 for Information-Theoretic Based Target Search with Multiple Agents

Figure 3 for Information-Theoretic Based Target Search with Multiple Agents

Figure 4 for Information-Theoretic Based Target Search with Multiple Agents

This paper proposes an online path planning and motion generation algorithm for heterogeneous robot teams performing target search in a real-world environment. Path selection for each robot is optimized using an information-theoretic formulation and is computed sequentially for each agent. First, we generate candidate trajectories sampled from both global waypoints derived from vertical cell decomposition and local frontier points. From this set, we choose the path with maximum information gain. We demonstrate that the hierarchical sequential decision-making structure provided by the algorithm is scalable to multiple agents in a simulation setup. We also validate our framework in a real-world apartment setting using a two robot team comprised of the Unitree A1 quadruped and the Toyota HSR mobile manipulator searching for a person. The agents leverage an efficient leader-follower communication structure where only critical information is shared.

* 6 pages, 6 figures

Via

Access Paper or Ask Questions

One4all User Representation for Recommender Systems in E-commerce

May 24, 2021
Kyuyong Shin, Hanock Kwak, Kyung-Min Kim, Minkyu Kim, Young-Jin Park, Jisu Jeong, Seungjae Jung

Figure 1 for One4all User Representation for Recommender Systems in E-commerce

Figure 2 for One4all User Representation for Recommender Systems in E-commerce

Figure 3 for One4all User Representation for Recommender Systems in E-commerce

Figure 4 for One4all User Representation for Recommender Systems in E-commerce

General-purpose representation learning through large-scale pre-training has shown promising results in the various machine learning fields. For an e-commerce domain, the objective of general-purpose, i.e., one for all, representations would be efficient applications for extensive downstream tasks such as user profiling, targeting, and recommendation tasks. In this paper, we systematically compare the generalizability of two learning strategies, i.e., transfer learning through the proposed model, ShopperBERT, vs. learning from scratch. ShopperBERT learns nine pretext tasks with 79.2M parameters from 0.8B user behaviors collected over two years to produce user embeddings. As a result, the MLPs that employ our embedding method outperform more complex models trained from scratch for five out of six tasks. Specifically, the pre-trained embeddings have superiority over the task-specific supervised features and the strong baselines, which learn the auxiliary dataset for the cold-start problem. We also show the computational efficiency and embedding visualization of the pre-trained features.

Via

Access Paper or Ask Questions

Active Object Tracking using Context Estimation: Handling Occlusions and Detecting Missing Targets

Dec 14, 2019
Minkyu Kim, Luis Sentis

Figure 1 for Active Object Tracking using Context Estimation: Handling Occlusions and Detecting Missing Targets

Figure 2 for Active Object Tracking using Context Estimation: Handling Occlusions and Detecting Missing Targets

Figure 3 for Active Object Tracking using Context Estimation: Handling Occlusions and Detecting Missing Targets

Figure 4 for Active Object Tracking using Context Estimation: Handling Occlusions and Detecting Missing Targets

When performing visual servoing or object tracking tasks, active sensor planning is essential to keep targets in sight or to relocate them when missing. In particular, when dealing with a known target missing from the sensor's field of view, we propose using prior knowledge related to contextual information to estimate its possible location. To this end, this study proposes a Dynamic Bayesian Network that uses contextual information to effectively search for targets. Monte Carlo particle filtering is employed to approximate the posterior probability of the target's state, from which uncertainty is defined. We define the robot's utility function via information-theoretic formalism as seeking the optimal action which reduces uncertainty of a task, prompting robot agents to investigate the location where the target most likely might exist. Using a context state model, we design the agent's high-level decision framework using a Partially-Observable Markov Decision Process. Based on the estimated belief state of the context via sequential observations, the robot's navigation actions are determined to conduct exploratory and detection tasks. By using this multi-modal context model, our agent can effectively handle basic dynamic events, such as obstruction of targets or their absence from the field of view. We implement and demonstrate these capabilities on a mobile robot in real-time.

* 11 pages, 8 figures

Via

Access Paper or Ask Questions

Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation

Jul 24, 2019
Kyung-Min Kim, Donghyun Kwak, Hanock Kwak, Young-Jin Park, Sangkwon Sim, Jae-Han Cho, Minkyu Kim, Jihun Kwon, Nako Sung, Jung-Woo Ha

Figure 1 for Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation

Figure 2 for Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation

Figure 3 for Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation

Figure 4 for Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation

Graph Neural Networks (GNNs) have been emerging as a promising method for relational representation including recommender systems. However, various challenging issues of social graphs hinder the practical usage of GNNs for social recommendation, such as their complex noisy connections and high heterogeneity. The oversmoothing of GNNs is an obstacle of GNN-based social recommendation as well. Here we propose a new graph embedding method Heterogeneous Graph Propagation (HGP) to tackle these issues. HGP uses a group-user-item tripartite graph as input to reduce the number of edges and the complexity of paths in a social graph. To solve the oversmoothing issue, HGP embeds nodes under a personalized PageRank based propagation scheme, separately for group-user graph and user-item graph. Node embeddings from each graph are integrated using an attention mechanism. We evaluate our HGP on a large-scale real-world dataset consisting of 1,645,279 nodes and 4,711,208 edges. The experimental results show that HGP outperforms several baselines in terms of AUC and F1-score metrics.

* 6 pages, accepted for RecSys 2019 LBR Track

Via

Access Paper or Ask Questions

Toward Achieving Formal Guarantees for Human-Aware Controllers in Human-Robot Interactions

Mar 04, 2019
Rachel Schlossman, Minkyu Kim, Ufuk Topcu, Luis Sentis

Figure 1 for Toward Achieving Formal Guarantees for Human-Aware Controllers in Human-Robot Interactions

Figure 2 for Toward Achieving Formal Guarantees for Human-Aware Controllers in Human-Robot Interactions

Figure 3 for Toward Achieving Formal Guarantees for Human-Aware Controllers in Human-Robot Interactions

Figure 4 for Toward Achieving Formal Guarantees for Human-Aware Controllers in Human-Robot Interactions

With the primary objective of human-robot interaction being to support humans' goals, there exists a need to formally synthesize robot controllers that can provide the desired service. Synthesis techniques have the benefit of providing formal guarantees for specification satisfaction. There is potential to apply these techniques for devising robot controllers whose specifications are coupled with human needs. This paper explores the use of formal methods to construct human-aware robot controllers to support the productivity requirements of humans. We tackle these types of scenarios via human workload-informed models and reactive synthesis. This strategy allows us to synthesize controllers that fulfill formal specifications that are expressed as linear temporal logic formulas. We present a case study in which we reason about a work delivery and pickup task such that the robot increases worker productivity, but not stress induced by high work backlog. We demonstrate our controller using the Toyota HSR, a mobile manipulator robot. The results demonstrate the realization of a robust robot controller that is guaranteed to properly reason and react in collaborative tasks with human partners.

Via

Access Paper or Ask Questions