Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Junghoon Seo

Semi-Implicit Hybrid Gradient Methods with Application to Adversarial Robustness

Feb 21, 2022

Beomsu Kim, Junghoon Seo

Figure 1 for Semi-Implicit Hybrid Gradient Methods with Application to Adversarial Robustness

Figure 2 for Semi-Implicit Hybrid Gradient Methods with Application to Adversarial Robustness

Figure 3 for Semi-Implicit Hybrid Gradient Methods with Application to Adversarial Robustness

Figure 4 for Semi-Implicit Hybrid Gradient Methods with Application to Adversarial Robustness

Abstract:Adversarial examples, crafted by adding imperceptible perturbations to natural inputs, can easily fool deep neural networks (DNNs). One of the most successful methods for training adversarially robust DNNs is solving a nonconvex-nonconcave minimax problem with an adversarial training (AT) algorithm. However, among the many AT algorithms, only Dynamic AT (DAT) and You Only Propagate Once (YOPO) guarantee convergence to a stationary point. In this work, we generalize the stochastic primal-dual hybrid gradient algorithm to develop semi-implicit hybrid gradient methods (SI-HGs) for finding stationary points of nonconvex-nonconcave minimax problems. SI-HGs have the convergence rate $O(1/K)$, which improves upon the rate $O(1/K^{1/2})$ of DAT and YOPO. We devise a practical variant of SI-HGs, and show that it outperforms other AT algorithms in terms of convergence speed and robustness.

* International Conference on Artificial Intelligence and Statistics (AISTATS) 2022

Via

Access Paper or Ask Questions

Contrastive Multiview Coding with Electro-optics for SAR Semantic Segmentation

Aug 31, 2021

Keumgang Cha, Junghoon Seo, Yeji Choi

Figure 1 for Contrastive Multiview Coding with Electro-optics for SAR Semantic Segmentation

Figure 2 for Contrastive Multiview Coding with Electro-optics for SAR Semantic Segmentation

Figure 3 for Contrastive Multiview Coding with Electro-optics for SAR Semantic Segmentation

Figure 4 for Contrastive Multiview Coding with Electro-optics for SAR Semantic Segmentation

Abstract:In the training of deep learning models, how the model parameters are initialized greatly affects the model performance, sample efficiency, and convergence speed. Representation learning for model initialization has recently been actively studied in the remote sensing field. In particular, the appearance characteristics of the imagery obtained using the a synthetic aperture radar (SAR) sensor are quite different from those of general electro-optical (EO) images, and thus representation learning is even more important in remote sensing domain. Motivated from contrastive multiview coding, we propose multi-modal representation learning for SAR semantic segmentation. Unlike previous studies, our method jointly uses EO imagery, SAR imagery, and a label mask. Several experiments show that our approach is superior to the existing methods in model performance, sample efficiency, and convergence speed.

* To be appeared in IEEE GRSL. DOI to be updated

Via

Access Paper or Ask Questions

Training Domain-invariant Object Detector Faster with Feature Replay and Slow Learner

May 31, 2021

Chaehyeon Lee, Junghoon Seo, Heechul Jung

Figure 1 for Training Domain-invariant Object Detector Faster with Feature Replay and Slow Learner

Figure 2 for Training Domain-invariant Object Detector Faster with Feature Replay and Slow Learner

Figure 3 for Training Domain-invariant Object Detector Faster with Feature Replay and Slow Learner

Figure 4 for Training Domain-invariant Object Detector Faster with Feature Replay and Slow Learner

Abstract:In deep learning-based object detection on remote sensing domain, nuisance factors, which affect observed variables while not affecting predictor variables, often matters because they cause domain changes. Previously, nuisance disentangled feature transformation (NDFT) was proposed to build domain-invariant feature extractor with with knowledge of nuisance factors. However, NDFT requires enormous time in a training phase, so it has been impractical. In this paper, we introduce our proposed method, A-NDFT, which is an improvement to NDFT. A-NDFT utilizes two acceleration techniques, feature replay and slow learner. Consequently, on a large-scale UAVDT benchmark, it is shown that our framework can reduce the training time of NDFT from 31 hours to 3 hours while still maintaining the performance. The code will be made publicly available online.

* 2021 CVPR Workshop

Via

Access Paper or Ask Questions

On the Power of Deep but Naive Partial Label Learning

Oct 22, 2020

Junghoon Seo, Joon Suk Huh

Figure 1 for On the Power of Deep but Naive Partial Label Learning

Figure 2 for On the Power of Deep but Naive Partial Label Learning

Abstract:Partial label learning (PLL) is a class of weakly supervised learning where each training instance consists of a data and a set of candidate labels containing a unique ground truth label. To tackle this problem, a majority of current state-of-the-art methods employs either label disambiguation or averaging strategies. So far, PLL methods without such techniques have been considered impractical. In this paper, we challenge this view by revealing the hidden power of the oldest and naivest PLL method when it is instantiated with deep neural networks. Specifically, we show that, with deep neural networks, the naive model can achieve competitive performances against the other state-of-the-art methods, suggesting it as a strong baseline for PLL. We also address the question of how and why such a naive model works well with deep neural networks. Our empirical results indicate that deep neural networks trained on partially labeled examples generalize very well even in the over-parametrized regime and without label disambiguations or regularizations. We point out that existing learning theories on PLL are vacuous in the over-parametrized regime. Hence they cannot explain why the deep naive method works. We propose an alternative theory on how deep learning generalize in PLL problems.

Via

Access Paper or Ask Questions

Revisiting Classical Bagging with Modern Transfer Learning for On-the-fly Disaster Damage Detector

Oct 04, 2019

Junghoon Seo, Seungwon Lee, Beomsu Kim, Taegyun Jeon

Figure 1 for Revisiting Classical Bagging with Modern Transfer Learning for On-the-fly Disaster Damage Detector

Figure 2 for Revisiting Classical Bagging with Modern Transfer Learning for On-the-fly Disaster Damage Detector

Figure 3 for Revisiting Classical Bagging with Modern Transfer Learning for On-the-fly Disaster Damage Detector

Figure 4 for Revisiting Classical Bagging with Modern Transfer Learning for On-the-fly Disaster Damage Detector

Abstract:Automatic post-disaster damage detection using aerial imagery is crucial for quick assessment of damage caused by disaster and development of a recovery plan. The main problem preventing us from creating an applicable model in practice is that damaged (positive) examples we are trying to detect are much harder to obtain than undamaged (negative) examples, especially in short time. In this paper, we revisit the classical bootstrap aggregating approach in the context of modern transfer learning for data-efficient disaster damage detection. Unlike previous classical ensemble learning articles, our work points out the effectiveness of simple bagging in deep transfer learning that has been underestimated in the context of imbalanced classification. Benchmark results on the AIST Building Change Detection dataset show that our approach significantly outperforms existing methodologies, including the recently proposed disentanglement learning.

* Accepted at the 2019 NeurIPS Workshop on Artificial Intelligence for Humanitarian Assistance and Disaster Response(AI+HADR 2019)

Via

Access Paper or Ask Questions

Deep Closed-Form Subspace Clustering

Aug 26, 2019

Junghoon Seo, Jamyoung Koo, Taegyun Jeon

Figure 1 for Deep Closed-Form Subspace Clustering

Figure 2 for Deep Closed-Form Subspace Clustering

Figure 3 for Deep Closed-Form Subspace Clustering

Figure 4 for Deep Closed-Form Subspace Clustering

Abstract:We propose Deep Closed-Form Subspace Clustering (DCFSC), a new embarrassingly simple model for subspace clustering with learning non-linear mapping. Compared with the previous deep subspace clustering (DSC) techniques, our DCFSC does not have any parameters at all for the self-expressive layer. Instead, DCFSC utilizes the implicit data-driven self-expressive layer derived from closed-form shallow auto-encoder. Moreover, DCFSC also has no complicated optimization scheme, unlike the other subspace clustering methods. With its extreme simplicity, DCFSC has significant memory-related benefits over the existing DSC method, especially on the large dataset. Several experiments showed that our DCFSC model had enough potential to be a new reference model for subspace clustering on large-scale high-dimensional dataset.

* Accepted at the 2019 ICCV Workshop on Robust Subspace Learning and Applications in Computer Vision (RSL-CV 2019)

Via

Access Paper or Ask Questions

NL-LinkNet: Toward Lighter but More Accurate Road Extraction with Non-Local Operations

Aug 22, 2019

Yooseung Wang, Junghoon Seo, Taegyun Jeon

Figure 1 for NL-LinkNet: Toward Lighter but More Accurate Road Extraction with Non-Local Operations

Figure 2 for NL-LinkNet: Toward Lighter but More Accurate Road Extraction with Non-Local Operations

Figure 3 for NL-LinkNet: Toward Lighter but More Accurate Road Extraction with Non-Local Operations

Figure 4 for NL-LinkNet: Toward Lighter but More Accurate Road Extraction with Non-Local Operations

Abstract:Road extraction from very high resolution satellite images is one of the most important topics in the field of remote sensing. For the road segmentation problem, spatial properties of the data can usually be captured using Convolutional Neural Networks. However, this approach only considers a few local neighborhoods at a time and has difficulty capturing long-range dependencies. In order to overcome the problem, we propose Non-Local LinkNet with non-local blocks that can grasp relations between global features. It enables each spatial feature point to refer to all other contextual information and results in more accurate road segmentation. In detail, our method achieved 65.00\% mIOU scores on the DeepGlobe 2018 Road Extraction Challenge dataset. Our best model outperformed D-LinkNet, 1st-ranked solution, by a significant gap of mIOU 0.88\% with much less number of parameters. We also present empirical analyses on proper usage of non-local blocks for the baseline model.

* Under review

Via

Access Paper or Ask Questions

Bridging Adversarial Robustness and Gradient Interpretability

Apr 19, 2019

Beomsu Kim, Junghoon Seo, Taegyun Jeon

Figure 1 for Bridging Adversarial Robustness and Gradient Interpretability

Figure 2 for Bridging Adversarial Robustness and Gradient Interpretability

Figure 3 for Bridging Adversarial Robustness and Gradient Interpretability

Figure 4 for Bridging Adversarial Robustness and Gradient Interpretability

Abstract:Adversarial training is a training scheme designed to counter adversarial attacks by augmenting the training dataset with adversarial examples. Surprisingly, several studies have observed that loss gradients from adversarially trained DNNs are visually more interpretable than those from standard DNNs. Although this phenomenon is interesting, there are only few works that have offered an explanation. In this paper, we attempted to bridge this gap between adversarial robustness and gradient interpretability. To this end, we identified that loss gradients from adversarially trained DNNs align better with human perception because adversarial training restricts gradients closer to the image manifold. We then demonstrated that adversarial training causes loss gradients to be quantitatively meaningful. Finally, we showed that under the adversarial training framework, there exists an empirical trade-off between test accuracy and loss gradient interpretability and proposed two potential approaches to resolving this trade-off.

* Accepted at the 2019 ICLR Workshop on Safe Machine Learning: Specification, Robustness, and Assurance (SafeML 2019)

Via

Access Paper or Ask Questions

Why are Saliency Maps Noisy? Cause of and Solution to Noisy Saliency Maps

Feb 20, 2019

Beomsu Kim, Junghoon Seo, SeungHyun Jeon, Jamyoung Koo, Jeongyeol Choe, Taegyun Jeon

Figure 1 for Why are Saliency Maps Noisy? Cause of and Solution to Noisy Saliency Maps

Figure 2 for Why are Saliency Maps Noisy? Cause of and Solution to Noisy Saliency Maps

Figure 3 for Why are Saliency Maps Noisy? Cause of and Solution to Noisy Saliency Maps

Figure 4 for Why are Saliency Maps Noisy? Cause of and Solution to Noisy Saliency Maps

Abstract:Saliency Map, the gradient of the score function with respect to the input, is the most basic technique for interpreting deep neural network decisions. However, saliency maps are often visually noisy. Although several hypotheses were proposed to account for this phenomenon, there are few works that provide rigorous analyses of noisy saliency maps. In this paper, we identify that noise occurs in saliency maps when irrelevant features pass through ReLU activation functions. Then we propose Rectified Gradient, a method that solves this problem through layer-wise thresholding during backpropagation. Experiments with neural networks trained on CIFAR-10 and ImageNet showed effectiveness of our method and its superiority to other attribution methods.

Via

Access Paper or Ask Questions

Domain Adaptive Generation of Aircraft on Satellite Imagery via Simulated and Unsupervised Learning

Jun 08, 2018

Junghoon Seo, Seunghyun Jeon, Taegyun Jeon

Figure 1 for Domain Adaptive Generation of Aircraft on Satellite Imagery via Simulated and Unsupervised Learning

Figure 2 for Domain Adaptive Generation of Aircraft on Satellite Imagery via Simulated and Unsupervised Learning

Figure 3 for Domain Adaptive Generation of Aircraft on Satellite Imagery via Simulated and Unsupervised Learning

Abstract:Object detection and classification for aircraft are the most important tasks in the satellite image analysis. The success of modern detection and classification methods has been based on machine learning and deep learning. One of the key requirements for those learning processes is huge data to train. However, there is an insufficient portion of aircraft since the targets are on military action and oper- ation. Considering the characteristics of satellite imagery, this paper attempts to provide a framework of the simulated and unsupervised methodology without any additional su- pervision or physical assumptions. Finally, the qualitative and quantitative analysis revealed a potential to replenish insufficient data for machine learning platform for satellite image analysis.

* presented at the International Workshop on Machine Learning for Artificial Intelligence Platforms held in 2017 Asian Conference on Machine Learning (MLAIP@ACML)

Via

Access Paper or Ask Questions