Alert button
Picture for Cheolhyoung Lee

Cheolhyoung Lee

Alert button

Unsupervised Learning of Initialization in Deep Neural Networks via Maximum Mean Discrepancy

Add code
Bookmark button
Alert button
Feb 08, 2023
Cheolhyoung Lee, Kyunghyun Cho

Figure 1 for Unsupervised Learning of Initialization in Deep Neural Networks via Maximum Mean Discrepancy
Figure 2 for Unsupervised Learning of Initialization in Deep Neural Networks via Maximum Mean Discrepancy
Figure 3 for Unsupervised Learning of Initialization in Deep Neural Networks via Maximum Mean Discrepancy
Figure 4 for Unsupervised Learning of Initialization in Deep Neural Networks via Maximum Mean Discrepancy
Viaarxiv icon

A Non-monotonic Self-terminating Language Model

Add code
Bookmark button
Alert button
Oct 03, 2022
Eugene Choi, Cheolhyoung Lee, Kyunghyun Cho

Figure 1 for A Non-monotonic Self-terminating Language Model
Figure 2 for A Non-monotonic Self-terminating Language Model
Figure 3 for A Non-monotonic Self-terminating Language Model
Figure 4 for A Non-monotonic Self-terminating Language Model
Viaarxiv icon

Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models

Add code
Bookmark button
Alert button
Sep 25, 2019
Cheolhyoung Lee, Kyunghyun Cho, Wanmo Kang

Figure 1 for Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Figure 2 for Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Figure 3 for Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Figure 4 for Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
Viaarxiv icon

Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning

Add code
Bookmark button
Alert button
Sep 29, 2018
Cheolhyoung Lee, Kyunghyun Cho, Wanmo Kang

Figure 1 for Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning
Figure 2 for Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning
Figure 3 for Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning
Figure 4 for Directional Analysis of Stochastic Gradient Descent via von Mises-Fisher Distributions in Deep learning
Viaarxiv icon