Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jinwoo Shin

Korea Advanced Institute of Science and Technology

Learning from Failure: Training Debiased Classifier from Biased Classifier

Jul 06, 2020

Junhyun Nam, Hyuntak Cha, Sungsoo Ahn, Jaeho Lee, Jinwoo Shin

Figure 1 for Learning from Failure: Training Debiased Classifier from Biased Classifier

Figure 2 for Learning from Failure: Training Debiased Classifier from Biased Classifier

Figure 3 for Learning from Failure: Training Debiased Classifier from Biased Classifier

Figure 4 for Learning from Failure: Training Debiased Classifier from Biased Classifier

Abstract:Neural networks often learn to make predictions that overly rely on spurious correlation existing in the dataset, which causes the model to be biased. While previous work tackles this issue with domain-specific knowledge or explicit supervision on the spuriously correlated attributes, we instead tackle a more challenging setting where such information is unavailable. To this end, we first observe that neural networks learn to rely on the spurious correlation only when it is ''easier'' to learn than the desired knowledge, and such reliance is most prominent during the early phase of training. Based on the observations, we propose a failure-based debiasing scheme by training a pair of neural networks simultaneously. Our main idea is twofold; (a) we intentionally train the first network to be biased by repeatedly amplifying its ''prejudice'', and (b) we debias the training of the second network by focusing on samples that go against the prejudice of the biased network in (a). Extensive experiments demonstrate that our method significantly improves the training of network against various types of biases in both synthetic and real-world datasets. Surprisingly, our framework even occasionally outperforms the debiasing methods requiring explicit supervision of the spuriously correlated attributes.

Via

Access Paper or Ask Questions

Learning What to Defer for Maximum Independent Sets

Jun 29, 2020

Sungsoo Ahn, Younggyo Seo, Jinwoo Shin

Figure 1 for Learning What to Defer for Maximum Independent Sets

Figure 2 for Learning What to Defer for Maximum Independent Sets

Figure 3 for Learning What to Defer for Maximum Independent Sets

Figure 4 for Learning What to Defer for Maximum Independent Sets

Abstract:Designing efficient algorithms for combinatorial optimization appears ubiquitously in various scientific fields. Recently, deep reinforcement learning (DRL) frameworks have gained considerable attention as a new approach: they can automate the design of a solver while relying less on sophisticated domain knowledge of the target problem. However, the existing DRL solvers determine the solution using a number of stages proportional to the number of elements in the solution, which severely limits their applicability to large-scale graphs. In this paper, we seek to resolve this issue by proposing a novel DRL scheme, coined learning what to defer (LwD), where the agent adaptively shrinks or stretch the number of stages by learning to distribute the element-wise decisions of the solution at each stage. We apply the proposed framework to the maximum independent set (MIS) problem, and demonstrate its significant improvement over the current state-of-the-art DRL scheme. We also show that LwD can outperform the conventional MIS solvers on large-scale graphs having millions of vertices, under a limited time budget.

* Added section for acknowledgment

Via

Access Paper or Ask Questions

Learning to Generate Noise for Robustness against Multiple Perturbations

Jun 22, 2020

Divyam Madaan, Jinwoo Shin, Sung Ju Hwang

Figure 1 for Learning to Generate Noise for Robustness against Multiple Perturbations

Figure 2 for Learning to Generate Noise for Robustness against Multiple Perturbations

Figure 3 for Learning to Generate Noise for Robustness against Multiple Perturbations

Figure 4 for Learning to Generate Noise for Robustness against Multiple Perturbations

Abstract:Adversarial learning has emerged as one of the successful techniques to circumvent the susceptibility of existing methods against adversarial perturbations. However, the majority of existing defense methods are tailored to defend against a single category of adversarial perturbation (e.g. $\ell_\infty$-attack). In safety-critical applications, this makes these methods extraneous as the attacker can adopt diverse adversaries to deceive the system. To tackle this challenge of robustness against multiple perturbations, we propose a novel meta-learning framework that explicitly learns to generate noise to improve the model's robustness against multiple types of attacks. Specifically, we propose Meta Noise Generator (MNG) that outputs optimal noise to stochastically perturb a given sample, such that it helps lower the error on diverse adversarial perturbations. However, training on multiple perturbations simultaneously significantly increases the computational overhead during training. To address this issue, we train our MNG while randomly sampling an attack at each epoch, which incurs negligible overhead over standard adversarial training. We validate the robustness of our framework on various datasets and against a wide variety of unseen perturbations, demonstrating that it significantly outperforms the relevant baselines across multiple perturbations with marginal computational cost compared to the multiple perturbations training.

Via

Access Paper or Ask Questions

QOPT: Optimistic Value Function Decentralization for Cooperative Multi-Agent Reinforcement Learning

Jun 22, 2020

Kyunghwan Son, Sungsoo Ahn, Roben Delos Reyes, Jinwoo Shin, Yung Yi

Figure 1 for QOPT: Optimistic Value Function Decentralization for Cooperative Multi-Agent Reinforcement Learning

Figure 2 for QOPT: Optimistic Value Function Decentralization for Cooperative Multi-Agent Reinforcement Learning

Figure 3 for QOPT: Optimistic Value Function Decentralization for Cooperative Multi-Agent Reinforcement Learning

Figure 4 for QOPT: Optimistic Value Function Decentralization for Cooperative Multi-Agent Reinforcement Learning

Abstract:We propose a novel value-based algorithm for cooperative multi-agent reinforcement learning, under the paradigm of centralized training with decentralized execution. The proposed algorithm, coined QOPT, is based on the "optimistic" training scheme using two action-value estimators with separate roles: (i) true action-value estimation and (ii) decentralization of optimal action. By construction, our framework allows the latter action-value estimator to achieve (ii) while representing a richer class of joint action-value estimators than that of the state-of-the-art algorithm, i.e., QMIX. Our experiments demonstrate that QOPT newly achieves state-of-the-art performance in the StarCraft Multi-Agent Challenge environment. In particular, ours significantly outperform the baselines for the case where non-cooperative behaviors are penalized more aggressively.

Via

Access Paper or Ask Questions

Minimum Width for Universal Approximation

Jun 16, 2020

Sejun Park, Chulhee Yun, Jaeho Lee, Jinwoo Shin

Figure 1 for Minimum Width for Universal Approximation

Figure 2 for Minimum Width for Universal Approximation

Figure 3 for Minimum Width for Universal Approximation

Figure 4 for Minimum Width for Universal Approximation

Abstract:The universal approximation property of width-bounded networks has been studied as a dual of classical universal approximation results on depth-bounded networks. However, the critical width enabling the universal approximation has not been exactly characterized in terms of the input dimension $d_x$ and the output dimension $d_y$. In this work, we provide the first definitive result in this direction for networks using the ReLU activation functions: The minimum width required for the universal approximation of the $L^p$ functions is exactly $\max\{d_x+1,d_y\}$. We also prove that the same conclusion does not hold for the uniform approximation with ReLU, but does hold with an additional threshold activation function. Our proof technique can be also used to derive a tighter upper bound on the minimum width required for the universal approximation using networks with general activation functions.

Via

Access Paper or Ask Questions

Learning Bounds for Risk-sensitive Learning

Jun 15, 2020

Jaeho Lee, Sejun Park, Jinwoo Shin

Figure 1 for Learning Bounds for Risk-sensitive Learning

Figure 2 for Learning Bounds for Risk-sensitive Learning

Figure 3 for Learning Bounds for Risk-sensitive Learning

Abstract:In risk-sensitive learning, one aims to find a hypothesis that minimizes a risk-averse (or risk-seeking) measure of loss, instead of the standard expected loss. In this paper, we propose to study the generalization properties of risk-sensitive learning schemes whose optimand is described via optimized certainty equivalents (OCE): our general scheme can handle various known risks, e.g., the entropic risk, mean-variance, and conditional value-at-risk, as special cases. We provide two learning bounds on the performance of empirical OCE minimizer. The first result gives an OCE guarantee based on the Rademacher average of the hypothesis space, which generalizes and improves existing results on the expected loss and the conditional value-at-risk. The second result, based on a novel variance-based characterization of OCE, gives an expected loss guarantee with a suppressed dependence on the smoothness of the selected OCE. Finally, we demonstrate the practical implications of the proposed bounds via exploratory experiments on neural networks.

* 20 pages

Via

Access Paper or Ask Questions

Consistency Regularization for Certified Robustness of Smoothed Classifiers

Jun 07, 2020

Jongheon Jeong, Jinwoo Shin

Figure 1 for Consistency Regularization for Certified Robustness of Smoothed Classifiers

Figure 2 for Consistency Regularization for Certified Robustness of Smoothed Classifiers

Figure 3 for Consistency Regularization for Certified Robustness of Smoothed Classifiers

Figure 4 for Consistency Regularization for Certified Robustness of Smoothed Classifiers

Abstract:A recent technique of randomized smoothing has shown that the worst-case (adversarial) $\ell_2$-robustness can be transformed into the average-case Gaussian-robustness by "smoothing" a classifier, i.e., by considering the averaged prediction over Gaussian noise. In this paradigm, one should rethink the notion of adversarial robustness in terms of generalization ability of a classifier under noisy observations. We found that the trade-off between accuracy and certified robustness of smoothed classifiers can be greatly controlled by simply regularizing the prediction consistency over noise. This relationship allows us to design a robust training objective without approximating a non-existing smoothed classifier, e.g., via soft smoothing. Our experiments under various deep neural network architectures and datasets demonstrate that the "certified" $\ell_2$-robustness can be dramatically improved with the proposed regularization, even achieving better or comparable results to the state-of-the-art approaches with significantly less training costs and hyperparameters.

* 17 pages

Via

Access Paper or Ask Questions

Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning

May 14, 2020

Kimin Lee, Younggyo Seo, Seunghyun Lee, Honglak Lee, Jinwoo Shin

Figure 1 for Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning

Figure 2 for Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning

Figure 3 for Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning

Figure 4 for Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning

Abstract:Model-based reinforcement learning (RL) enjoys several benefits, such as data-efficiency and planning, by learning a model of the environment's dynamics. However, learning a global model that can generalize across different dynamics is a challenging task. To tackle this problem, we decompose the task of learning a global dynamics model into two stages: (a) learning a context latent vector that captures the local dynamics, then (b) predicting the next state conditioned on it. In order to encode dynamics-specific information into the context latent vector, we introduce a novel loss function that encourages the context latent vector to be useful for predicting both forward and backward dynamics. The proposed method achieves superior generalization ability across various simulated robotics and control tasks, compared to existing RL schemes.

* First two authors contributed equally, website: https://sites.google.com/view/cadm code: https://github.com/younggyoseo/CaDM

Via

Access Paper or Ask Questions

Regularizing Class-wise Predictions via Self-knowledge Distillation

Apr 07, 2020

Sukmin Yun, Jongjin Park, Kimin Lee, Jinwoo Shin

Figure 1 for Regularizing Class-wise Predictions via Self-knowledge Distillation

Figure 2 for Regularizing Class-wise Predictions via Self-knowledge Distillation

Figure 3 for Regularizing Class-wise Predictions via Self-knowledge Distillation

Figure 4 for Regularizing Class-wise Predictions via Self-knowledge Distillation

Abstract:Deep neural networks with millions of parameters may suffer from poor generalization due to overfitting. To mitigate the issue, we propose a new regularization method that penalizes the predictive distribution between similar samples. In particular, we distill the predictive distribution between different samples of the same label during training. This results in regularizing the dark knowledge (i.e., the knowledge on wrong predictions) of a single network (i.e., a self-knowledge distillation) by forcing it to produce more meaningful and consistent predictions in a class-wise manner. Consequently, it mitigates overconfident predictions and reduces intra-class variations. Our experimental results on various image classification tasks demonstrate that the simple yet powerful method can significantly improve not only the generalization ability but also the calibration performance of modern convolutional neural networks.

* Accepted to CVPR 2020. Code is available at https://github.com/alinlab/cs-kd

Via

Access Paper or Ask Questions

M2m: Imbalanced Classification via Major-to-minor Translation

Apr 01, 2020

Jaehyung Kim, Jongheon Jeong, Jinwoo Shin

Figure 1 for M2m: Imbalanced Classification via Major-to-minor Translation

Figure 2 for M2m: Imbalanced Classification via Major-to-minor Translation

Figure 3 for M2m: Imbalanced Classification via Major-to-minor Translation

Figure 4 for M2m: Imbalanced Classification via Major-to-minor Translation

Abstract:In most real-world scenarios, labeled training datasets are highly class-imbalanced, where deep neural networks suffer from generalizing to a balanced testing criterion. In this paper, we explore a novel yet simple way to alleviate this issue by augmenting less-frequent classes via translating samples (e.g., images) from more-frequent classes. This simple approach enables a classifier to learn more generalizable features of minority classes, by transferring and leveraging the diversity of the majority information. Our experimental results on a variety of class-imbalanced datasets show that the proposed method improves the generalization on minority classes significantly compared to other existing re-sampling or re-weighting methods. The performance of our method even surpasses those of previous state-of-the-art methods for the imbalanced classification.

* 12 pages; Accepted to CVPR 2020

Via

Access Paper or Ask Questions