Picture for Aleksandr Shevchenko

Aleksandr Shevchenko

Attention with Trained Embeddings Provably Selects Important Tokens

Add code
May 22, 2025
Viaarxiv icon

Scaling Matters in Deep Structured-Prediction Models

Add code
Feb 28, 2019
Figure 1 for Scaling Matters in Deep Structured-Prediction Models
Figure 2 for Scaling Matters in Deep Structured-Prediction Models
Figure 3 for Scaling Matters in Deep Structured-Prediction Models
Figure 4 for Scaling Matters in Deep Structured-Prediction Models
Viaarxiv icon