Picture for Seungyeon Kim

Seungyeon Kim

Faster Cascades via Speculative Decoding

Add code
May 29, 2024
Viaarxiv icon

Supervision Complexity and its Role in Knowledge Distillation

Add code
Jan 28, 2023
Figure 1 for Supervision Complexity and its Role in Knowledge Distillation
Figure 2 for Supervision Complexity and its Role in Knowledge Distillation
Figure 3 for Supervision Complexity and its Role in Knowledge Distillation
Figure 4 for Supervision Complexity and its Role in Knowledge Distillation
Viaarxiv icon

EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval

Add code
Jan 27, 2023
Figure 1 for EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval
Figure 2 for EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval
Figure 3 for EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval
Figure 4 for EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval
Viaarxiv icon

Teacher Guided Training: An Efficient Framework for Knowledge Transfer

Add code
Aug 14, 2022
Figure 1 for Teacher Guided Training: An Efficient Framework for Knowledge Transfer
Figure 2 for Teacher Guided Training: An Efficient Framework for Knowledge Transfer
Figure 3 for Teacher Guided Training: An Efficient Framework for Knowledge Transfer
Figure 4 for Teacher Guided Training: An Efficient Framework for Knowledge Transfer
Viaarxiv icon

Balancing Robustness and Sensitivity using Feature Contrastive Learning

Add code
May 19, 2021
Figure 1 for Balancing Robustness and Sensitivity using Feature Contrastive Learning
Figure 2 for Balancing Robustness and Sensitivity using Feature Contrastive Learning
Figure 3 for Balancing Robustness and Sensitivity using Feature Contrastive Learning
Figure 4 for Balancing Robustness and Sensitivity using Feature Contrastive Learning
Viaarxiv icon

On the Reproducibility of Neural Network Predictions

Add code
Feb 05, 2021
Figure 1 for On the Reproducibility of Neural Network Predictions
Figure 2 for On the Reproducibility of Neural Network Predictions
Figure 3 for On the Reproducibility of Neural Network Predictions
Figure 4 for On the Reproducibility of Neural Network Predictions
Viaarxiv icon

Semantic Label Smoothing for Sequence to Sequence Problems

Add code
Oct 15, 2020
Figure 1 for Semantic Label Smoothing for Sequence to Sequence Problems
Figure 2 for Semantic Label Smoothing for Sequence to Sequence Problems
Figure 3 for Semantic Label Smoothing for Sequence to Sequence Problems
Figure 4 for Semantic Label Smoothing for Sequence to Sequence Problems
Viaarxiv icon

Evaluations and Methods for Explanation through Robustness Analysis

Add code
May 31, 2020
Figure 1 for Evaluations and Methods for Explanation through Robustness Analysis
Figure 2 for Evaluations and Methods for Explanation through Robustness Analysis
Figure 3 for Evaluations and Methods for Explanation through Robustness Analysis
Figure 4 for Evaluations and Methods for Explanation through Robustness Analysis
Viaarxiv icon

Why distillation helps: a statistical perspective

Add code
May 21, 2020
Figure 1 for Why distillation helps: a statistical perspective
Figure 2 for Why distillation helps: a statistical perspective
Figure 3 for Why distillation helps: a statistical perspective
Figure 4 for Why distillation helps: a statistical perspective
Viaarxiv icon

Why ADAM Beats SGD for Attention Models

Add code
Dec 06, 2019
Figure 1 for Why ADAM Beats SGD for Attention Models
Figure 2 for Why ADAM Beats SGD for Attention Models
Figure 3 for Why ADAM Beats SGD for Attention Models
Figure 4 for Why ADAM Beats SGD for Attention Models
Viaarxiv icon