Picture for Shukjae Choi

Shukjae Choi

Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor

Add code
Jan 23, 2024
Viaarxiv icon

Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks

Add code
Sep 18, 2023
Figure 1 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Figure 2 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Figure 3 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Figure 4 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Viaarxiv icon

Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling

Add code
Apr 18, 2023
Figure 1 for Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Figure 2 for Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Figure 3 for Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Figure 4 for Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Viaarxiv icon

Joint unsupervised and supervised learning for context-aware language identification

Add code
Apr 14, 2023
Figure 1 for Joint unsupervised and supervised learning for context-aware language identification
Figure 2 for Joint unsupervised and supervised learning for context-aware language identification
Figure 3 for Joint unsupervised and supervised learning for context-aware language identification
Figure 4 for Joint unsupervised and supervised learning for context-aware language identification
Viaarxiv icon

TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation

Add code
Nov 22, 2022
Figure 1 for TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Figure 2 for TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Figure 3 for TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Figure 4 for TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Viaarxiv icon

TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation

Add code
Sep 08, 2022
Figure 1 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Figure 2 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Figure 3 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Figure 4 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Viaarxiv icon