Picture for Yongyi Mao

Yongyi Mao

On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications

Add code
Oct 07, 2021
Figure 1 for On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications
Figure 2 for On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications
Figure 3 for On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications
Figure 4 for On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications
Viaarxiv icon

Robust Regularization with Adversarial Labelling of Perturbed Samples

Add code
May 28, 2021
Figure 1 for Robust Regularization with Adversarial Labelling of Perturbed Samples
Figure 2 for Robust Regularization with Adversarial Labelling of Perturbed Samples
Figure 3 for Robust Regularization with Adversarial Labelling of Perturbed Samples
Figure 4 for Robust Regularization with Adversarial Labelling of Perturbed Samples
Viaarxiv icon

On the Dynamics of Training Attention Models

Add code
Nov 19, 2020
Figure 1 for On the Dynamics of Training Attention Models
Figure 2 for On the Dynamics of Training Attention Models
Figure 3 for On the Dynamics of Training Attention Models
Figure 4 for On the Dynamics of Training Attention Models
Viaarxiv icon

Regularizing Neural Networks via Adversarial Model Perturbation

Add code
Oct 10, 2020
Figure 1 for Regularizing Neural Networks via Adversarial Model Perturbation
Figure 2 for Regularizing Neural Networks via Adversarial Model Perturbation
Figure 3 for Regularizing Neural Networks via Adversarial Model Perturbation
Figure 4 for Regularizing Neural Networks via Adversarial Model Perturbation
Viaarxiv icon

Neural Dialogue State Tracking with Temporally Expressive Networks

Add code
Oct 03, 2020
Figure 1 for Neural Dialogue State Tracking with Temporally Expressive Networks
Figure 2 for Neural Dialogue State Tracking with Temporally Expressive Networks
Figure 3 for Neural Dialogue State Tracking with Temporally Expressive Networks
Figure 4 for Neural Dialogue State Tracking with Temporally Expressive Networks
Viaarxiv icon

Parallel Interactive Networks for Multi-Domain Dialogue State Generation

Add code
Oct 03, 2020
Figure 1 for Parallel Interactive Networks for Multi-Domain Dialogue State Generation
Figure 2 for Parallel Interactive Networks for Multi-Domain Dialogue State Generation
Figure 3 for Parallel Interactive Networks for Multi-Domain Dialogue State Generation
Figure 4 for Parallel Interactive Networks for Multi-Domain Dialogue State Generation
Viaarxiv icon

On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions

Add code
Sep 02, 2020
Figure 1 for On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions
Figure 2 for On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions
Figure 3 for On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions
Figure 4 for On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions
Viaarxiv icon

Recurrent Interaction Network for Jointly Extracting Entities and Classifying Relations

Add code
May 01, 2020
Figure 1 for Recurrent Interaction Network for Jointly Extracting Entities and Classifying Relations
Figure 2 for Recurrent Interaction Network for Jointly Extracting Entities and Classifying Relations
Figure 3 for Recurrent Interaction Network for Jointly Extracting Entities and Classifying Relations
Figure 4 for Recurrent Interaction Network for Jointly Extracting Entities and Classifying Relations
Viaarxiv icon

Aggregated Learning: A Vector-Quantization Approach to Learning Neural Network Classifiers

Add code
Jan 12, 2020
Figure 1 for Aggregated Learning: A Vector-Quantization Approach to Learning Neural Network Classifiers
Figure 2 for Aggregated Learning: A Vector-Quantization Approach to Learning Neural Network Classifiers
Figure 3 for Aggregated Learning: A Vector-Quantization Approach to Learning Neural Network Classifiers
Figure 4 for Aggregated Learning: A Vector-Quantization Approach to Learning Neural Network Classifiers
Viaarxiv icon

Uncover the Ground-Truth Relations in Distant Supervision: A Neural Expectation-Maximization Framework

Add code
Sep 12, 2019
Figure 1 for Uncover the Ground-Truth Relations in Distant Supervision: A Neural Expectation-Maximization Framework
Figure 2 for Uncover the Ground-Truth Relations in Distant Supervision: A Neural Expectation-Maximization Framework
Figure 3 for Uncover the Ground-Truth Relations in Distant Supervision: A Neural Expectation-Maximization Framework
Figure 4 for Uncover the Ground-Truth Relations in Distant Supervision: A Neural Expectation-Maximization Framework
Viaarxiv icon