Alert button
Picture for Hsin-Min Wang

Hsin-Min Wang

Alert button

Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement

Add code
Bookmark button
Alert button
Sep 20, 2023
Shafique Ahmed, Chia-Wei Chen, Wenze Ren, Chin-Jou Li, Ernie Chu, Jun-Cheng Chen, Amir Hussain, Hsin-Min Wang, Yu Tsao, Jen-Cheng Hou

Figure 1 for Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Figure 2 for Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Figure 3 for Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Figure 4 for Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Viaarxiv icon

Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids

Add code
Bookmark button
Alert button
Sep 18, 2023
Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao

Figure 1 for Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Figure 2 for Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Figure 3 for Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Figure 4 for Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Viaarxiv icon

Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model

Add code
Bookmark button
Alert button
Aug 18, 2023
Ryandhimas E. Zezario, Bo-Ren Brian Bai, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao

Figure 1 for Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
Figure 2 for Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
Figure 3 for Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
Figure 4 for Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
Viaarxiv icon

Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features

Add code
Bookmark button
Alert button
Jun 11, 2023
Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Yu Tsao, Tai-shih Chi, Hsin-Min Wang

Figure 1 for Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features
Figure 2 for Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features
Figure 3 for Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features
Viaarxiv icon

Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion

Add code
Bookmark button
Alert button
Jun 11, 2023
Yung-Lun Chien, Hsin-Hao Chen, Ming-Chi Yen, Shu-Wei Tsai, Hsin-Min Wang, Yu Tsao, Tai-Shih Chi

Figure 1 for Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
Figure 2 for Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
Figure 3 for Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
Figure 4 for Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
Viaarxiv icon

CasNet: Investigating Channel Robustness for Speech Separation

Add code
Bookmark button
Alert button
Oct 27, 2022
Fan-Lin Wang, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang

Figure 1 for CasNet: Investigating Channel Robustness for Speech Separation
Figure 2 for CasNet: Investigating Channel Robustness for Speech Separation
Figure 3 for CasNet: Investigating Channel Robustness for Speech Separation
Figure 4 for CasNet: Investigating Channel Robustness for Speech Separation
Viaarxiv icon

A Teacher-student Framework for Unsupervised Speech Enhancement Using Noise Remixing Training and Two-stage Inference

Add code
Bookmark button
Alert button
Oct 27, 2022
Li-Wei Chen, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang

Figure 1 for A Teacher-student Framework for Unsupervised Speech Enhancement Using Noise Remixing Training and Two-stage Inference
Figure 2 for A Teacher-student Framework for Unsupervised Speech Enhancement Using Noise Remixing Training and Two-stage Inference
Figure 3 for A Teacher-student Framework for Unsupervised Speech Enhancement Using Noise Remixing Training and Two-stage Inference
Viaarxiv icon

Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN

Add code
Bookmark button
Alert button
Sep 21, 2022
Yin-Ping Cho, Yu Tsao, Hsin-Min Wang, Yi-Wen Liu

Figure 1 for Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Figure 2 for Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Figure 3 for Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Figure 4 for Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Viaarxiv icon

NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling

Add code
Bookmark button
Alert button
Jun 18, 2022
Chi-Chang Lee, Cheng-Hung Hu, Yu-Chen Lin, Chu-Song Chen, Hsin-Min Wang, Yu Tsao

Figure 1 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Figure 2 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Figure 3 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Figure 4 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Viaarxiv icon

A Study of Using Cepstrogram for Countermeasure Against Replay Attacks

Add code
Bookmark button
Alert button
Apr 09, 2022
Shih-Kuang Lee, Yu Tsao, Hsin-Min Wang

Figure 1 for A Study of Using Cepstrogram for Countermeasure Against Replay Attacks
Figure 2 for A Study of Using Cepstrogram for Countermeasure Against Replay Attacks
Figure 3 for A Study of Using Cepstrogram for Countermeasure Against Replay Attacks
Figure 4 for A Study of Using Cepstrogram for Countermeasure Against Replay Attacks
Viaarxiv icon