Alert button
Picture for Byeong-Yeol Kim

Byeong-Yeol Kim

Alert button

Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor

Add code
Bookmark button
Alert button
Jan 23, 2024
Younglo Lee, Shukjae Choi, Byeong-Yeol Kim, Zhong-Qiu Wang, Shinji Watanabe

Viaarxiv icon

Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling

Add code
Bookmark button
Alert button
Apr 18, 2023
Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe

Figure 1 for Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Figure 2 for Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Figure 3 for Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Figure 4 for Neural Speech Enhancement with Very Low Algorithmic Latency and Complexity via Integrated Full- and Sub-Band Modeling
Viaarxiv icon

Joint unsupervised and supervised learning for context-aware language identification

Add code
Bookmark button
Alert button
Apr 14, 2023
Jinseok Park, Hyung Yong Kim, Jihwan Park, Byeong-Yeol Kim, Shukjae Choi, Yunkyu Lim

Figure 1 for Joint unsupervised and supervised learning for context-aware language identification
Figure 2 for Joint unsupervised and supervised learning for context-aware language identification
Figure 3 for Joint unsupervised and supervised learning for context-aware language identification
Figure 4 for Joint unsupervised and supervised learning for context-aware language identification
Viaarxiv icon

That's What I Said: Fully-Controllable Talking Face Generation

Add code
Bookmark button
Alert button
Apr 06, 2023
Youngjoon Jang, Kyeongha Rho, Jong-Bin Woo, Hyeongkeun Lee, Jihwan Park, Youshin Lim, Byeong-Yeol Kim, Joon Son Chung

Figure 1 for That's What I Said: Fully-Controllable Talking Face Generation
Figure 2 for That's What I Said: Fully-Controllable Talking Face Generation
Figure 3 for That's What I Said: Fully-Controllable Talking Face Generation
Figure 4 for That's What I Said: Fully-Controllable Talking Face Generation
Viaarxiv icon

CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis

Add code
Bookmark button
Alert button
Feb 28, 2023
Ji-Hoon Kim, Hong-Sun Yang, Yoon-Cheol Ju, Il-Hwan Kim, Byeong-Yeol Kim

Figure 1 for CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis
Figure 2 for CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis
Figure 3 for CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis
Figure 4 for CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis
Viaarxiv icon

TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation

Add code
Bookmark button
Alert button
Nov 22, 2022
Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe

Figure 1 for TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Figure 2 for TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Figure 3 for TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Figure 4 for TF-GridNet: Integrating Full- and Sub-Band Modeling for Speech Separation
Viaarxiv icon

Metric Learning for User-defined Keyword Spotting

Add code
Bookmark button
Alert button
Nov 01, 2022
Jaemin Jung, Youkyum Kim, Jihwan Park, Youshin Lim, Byeong-Yeol Kim, Youngjoon Jang, Joon Son Chung

Figure 1 for Metric Learning for User-defined Keyword Spotting
Figure 2 for Metric Learning for User-defined Keyword Spotting
Figure 3 for Metric Learning for User-defined Keyword Spotting
Figure 4 for Metric Learning for User-defined Keyword Spotting
Viaarxiv icon

TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation

Add code
Bookmark button
Alert button
Sep 08, 2022
Zhong-Qiu Wang, Samuele Cornell, Shukjae Choi, Younglo Lee, Byeong-Yeol Kim, Shinji Watanabe

Figure 1 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Figure 2 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Figure 3 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Figure 4 for TF-GridNet: Making Time-Frequency Domain Models Great Again for Monaural Speaker Separation
Viaarxiv icon