Alert button
Picture for Peng Shen

Peng Shen

Alert button

Generative linguistic representation for spoken language identification

Add code
Bookmark button
Alert button
Dec 18, 2023
Peng Shen, Xuguang Lu, Hisashi Kawai

Viaarxiv icon

Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition

Add code
Bookmark button
Alert button
Dec 18, 2023
Peng Shen, Xugang Lu, Hisashi Kawai

Viaarxiv icon

Neural domain alignment for spoken language recognition based on optimal transport

Add code
Bookmark button
Alert button
Oct 20, 2023
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Viaarxiv icon

Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR

Add code
Bookmark button
Alert button
Sep 28, 2023
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Viaarxiv icon

Cross-modal Alignment with Optimal Transport for CTC-based ASR

Add code
Bookmark button
Alert button
Sep 24, 2023
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Viaarxiv icon

Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition

Add code
Bookmark button
Alert button
Jul 29, 2022
Peng Shen, Xugang Lu, Hisashi Kawai

Figure 1 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Figure 2 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Figure 3 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Figure 4 for Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition
Viaarxiv icon

Transducer-based language embedding for spoken language identification

Add code
Bookmark button
Alert button
Apr 08, 2022
Peng Shen, Xugang Lu, Hisashi Kawai

Figure 1 for Transducer-based language embedding for spoken language identification
Figure 2 for Transducer-based language embedding for spoken language identification
Figure 3 for Transducer-based language embedding for spoken language identification
Viaarxiv icon

Partial Coupling of Optimal Transport for Spoken Language Identification

Add code
Bookmark button
Alert button
Mar 31, 2022
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Figure 1 for Partial Coupling of Optimal Transport for Spoken Language Identification
Figure 2 for Partial Coupling of Optimal Transport for Spoken Language Identification
Figure 3 for Partial Coupling of Optimal Transport for Spoken Language Identification
Figure 4 for Partial Coupling of Optimal Transport for Spoken Language Identification
Viaarxiv icon

Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Add code
Bookmark button
Alert button
Apr 07, 2021
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Figure 1 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
Figure 2 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
Figure 3 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
Figure 4 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification
Viaarxiv icon

Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification

Add code
Bookmark button
Alert button
Jan 09, 2021
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Figure 1 for Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification
Figure 2 for Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification
Figure 3 for Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification
Figure 4 for Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification
Viaarxiv icon