Alert button
Picture for Jaesong Lee

Jaesong Lee

Alert button

I3D: Transformer architectures with input-dependent dynamic depth for speech recognition

Mar 14, 2023
Yifan Peng, Jaesong Lee, Shinji Watanabe

Figure 1 for I3D: Transformer architectures with input-dependent dynamic depth for speech recognition
Figure 2 for I3D: Transformer architectures with input-dependent dynamic depth for speech recognition
Figure 3 for I3D: Transformer architectures with input-dependent dynamic depth for speech recognition
Figure 4 for I3D: Transformer architectures with input-dependent dynamic depth for speech recognition
Viaarxiv icon

Better Intermediates Improve CTC Inference

Apr 01, 2022
Tatsuya Komatsu, Yusuke Fujita, Jaesong Lee, Lukas Lee, Shinji Watanabe, Yusuke Kida

Figure 1 for Better Intermediates Improve CTC Inference
Figure 2 for Better Intermediates Improve CTC Inference
Figure 3 for Better Intermediates Improve CTC Inference
Viaarxiv icon

Memory-Efficient Training of RNN-Transducer with Sampled Softmax

Mar 31, 2022
Jaesong Lee, Lukas Lee, Shinji Watanabe

Figure 1 for Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Figure 2 for Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Figure 3 for Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Viaarxiv icon

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation

Oct 11, 2021
Yosuke Higuchi, Nanxin Chen, Yuya Fujita, Hirofumi Inaguma, Tatsuya Komatsu, Jaesong Lee, Jumon Nozaki, Tianzi Wang, Shinji Watanabe

Figure 1 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 2 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 3 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 4 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Viaarxiv icon

Layer Pruning on Demand with Intermediate CTC

Jun 17, 2021
Jaesong Lee, Jingu Kang, Shinji Watanabe

Figure 1 for Layer Pruning on Demand with Intermediate CTC
Figure 2 for Layer Pruning on Demand with Intermediate CTC
Figure 3 for Layer Pruning on Demand with Intermediate CTC
Figure 4 for Layer Pruning on Demand with Intermediate CTC
Viaarxiv icon

Intermediate Loss Regularization for CTC-based Speech Recognition

Feb 05, 2021
Jaesong Lee, Shinji Watanabe

Figure 1 for Intermediate Loss Regularization for CTC-based Speech Recognition
Figure 2 for Intermediate Loss Regularization for CTC-based Speech Recognition
Figure 3 for Intermediate Loss Regularization for CTC-based Speech Recognition
Figure 4 for Intermediate Loss Regularization for CTC-based Speech Recognition
Viaarxiv icon