Alert button
Picture for Shun Kiyono

Shun Kiyono

Alert button

Spike No More: Stabilizing the Pre-training of Large Language Models

Add code
Bookmark button
Alert button
Dec 28, 2023
Sho Takase, Shun Kiyono, Sosuke Kobayashi, Jun Suzuki

Viaarxiv icon

On Layer Normalizations and Residual Connections in Transformers

Add code
Bookmark button
Alert button
Jun 01, 2022
Sho Takase, Shun Kiyono, Sosuke Kobayashi, Jun Suzuki

Figure 1 for On Layer Normalizations and Residual Connections in Transformers
Figure 2 for On Layer Normalizations and Residual Connections in Transformers
Figure 3 for On Layer Normalizations and Residual Connections in Transformers
Figure 4 for On Layer Normalizations and Residual Connections in Transformers
Viaarxiv icon

Diverse Lottery Tickets Boost Ensemble from a Single Pretrained Model

Add code
Bookmark button
Alert button
May 24, 2022
Sosuke Kobayashi, Shun Kiyono, Jun Suzuki, Kentaro Inui

Figure 1 for Diverse Lottery Tickets Boost Ensemble from a Single Pretrained Model
Figure 2 for Diverse Lottery Tickets Boost Ensemble from a Single Pretrained Model
Figure 3 for Diverse Lottery Tickets Boost Ensemble from a Single Pretrained Model
Figure 4 for Diverse Lottery Tickets Boost Ensemble from a Single Pretrained Model
Viaarxiv icon

SHAPE: Shifted Absolute Position Embedding for Transformers

Add code
Bookmark button
Alert button
Sep 13, 2021
Shun Kiyono, Sosuke Kobayashi, Jun Suzuki, Kentaro Inui

Figure 1 for SHAPE: Shifted Absolute Position Embedding for Transformers
Figure 2 for SHAPE: Shifted Absolute Position Embedding for Transformers
Figure 3 for SHAPE: Shifted Absolute Position Embedding for Transformers
Figure 4 for SHAPE: Shifted Absolute Position Embedding for Transformers
Viaarxiv icon

Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution

Add code
Bookmark button
Alert button
Apr 15, 2021
Ryuto Konno, Shun Kiyono, Yuichiroh Matsubayashi, Hiroki Ouchi, Kentaro Inui

Figure 1 for Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution
Figure 2 for Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution
Figure 3 for Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution
Figure 4 for Pseudo Zero Pronoun Resolution Improves Zero Anaphora Resolution
Viaarxiv icon

Lessons on Parameter Sharing across Layers in Transformers

Add code
Bookmark button
Alert button
Apr 13, 2021
Sho Takase, Shun Kiyono

Figure 1 for Lessons on Parameter Sharing across Layers in Transformers
Figure 2 for Lessons on Parameter Sharing across Layers in Transformers
Figure 3 for Lessons on Parameter Sharing across Layers in Transformers
Figure 4 for Lessons on Parameter Sharing across Layers in Transformers
Viaarxiv icon

Rethinking Perturbations in Encoder-Decoders for Fast Training

Add code
Bookmark button
Alert button
Apr 05, 2021
Sho Takase, Shun Kiyono

Figure 1 for Rethinking Perturbations in Encoder-Decoders for Fast Training
Figure 2 for Rethinking Perturbations in Encoder-Decoders for Fast Training
Figure 3 for Rethinking Perturbations in Encoder-Decoders for Fast Training
Figure 4 for Rethinking Perturbations in Encoder-Decoders for Fast Training
Viaarxiv icon

An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution

Add code
Bookmark button
Alert button
Nov 04, 2020
Ryuto Konno, Yuichiroh Matsubayashi, Shun Kiyono, Hiroki Ouchi, Ryo Takahashi, Kentaro Inui

Figure 1 for An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution
Figure 2 for An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution
Figure 3 for An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution
Figure 4 for An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution
Viaarxiv icon

A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction

Add code
Bookmark button
Alert button
Oct 07, 2020
Masato Mita, Shun Kiyono, Masahiro Kaneko, Jun Suzuki, Kentaro Inui

Figure 1 for A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction
Figure 2 for A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction
Figure 3 for A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction
Figure 4 for A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction
Viaarxiv icon