Picture for Dogan Can

Dogan Can

Segmental Attention Decoding With Long Form Acoustic Encodings

Add code
Dec 16, 2025
Viaarxiv icon

Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval

Add code
Nov 04, 2024
Viaarxiv icon

Variable Attention Masking for Configurable Transformer Transducer Speech Recognition

Add code
Nov 02, 2022
Figure 1 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Figure 2 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Figure 3 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Figure 4 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Viaarxiv icon

Online Automatic Speech Recognition with Listen, Attend and Spell Model

Add code
Aug 12, 2020
Figure 1 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 2 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 3 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 4 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Viaarxiv icon