Picture for Saierdaer Yusuyin

Saierdaer Yusuyin

Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition

Add code
Mar 31, 2026
Viaarxiv icon

CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment

Add code
Feb 23, 2026
Viaarxiv icon

LLM-based phoneme-to-grapheme for phoneme-based speech recognition

Add code
Jun 05, 2025
Figure 1 for LLM-based phoneme-to-grapheme for phoneme-based speech recognition
Figure 2 for LLM-based phoneme-to-grapheme for phoneme-based speech recognition
Figure 3 for LLM-based phoneme-to-grapheme for phoneme-based speech recognition
Figure 4 for LLM-based phoneme-to-grapheme for phoneme-based speech recognition
Viaarxiv icon

Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision

Add code
Jun 04, 2024
Figure 1 for Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision
Figure 2 for Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision
Figure 3 for Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision
Figure 4 for Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision
Viaarxiv icon