Alert button
Picture for Yu Tsao

Yu Tsao

Alert button

AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models

Sep 19, 2023
Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee

Figure 1 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Figure 2 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Figure 3 for AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Viaarxiv icon

Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids

Sep 18, 2023
Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao

Figure 1 for Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Figure 2 for Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Figure 3 for Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Figure 4 for Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Viaarxiv icon

Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement

Sep 03, 2023
Yu-Wen Chen, Julia Hirschberg, Yu Tsao

Figure 1 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Figure 2 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Figure 3 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Figure 4 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Viaarxiv icon

Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model

Aug 18, 2023
Ryandhimas E. Zezario, Bo-Ren Brian Bai, Chiou-Shann Fuh, Hsin-Min Wang, Yu Tsao

Figure 1 for Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
Figure 2 for Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
Figure 3 for Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
Figure 4 for Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
Viaarxiv icon

Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations

Jul 15, 2023
Richard Lee Lai, Jen-Cheng Hou, Mandar Gogate, Kia Dashtipour, Amir Hussain, Yu Tsao

Figure 1 for Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations
Figure 2 for Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations
Figure 3 for Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations
Figure 4 for Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations
Viaarxiv icon

Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility

Jul 10, 2023
Hsin-Tien Chiang, Kuo-Hsuan Hung, Szu-Wei Fu, Heng-Cheng Kuo, Ming-Hsueh Tsai, Yu Tsao

Figure 1 for Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility
Figure 2 for Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility
Figure 3 for Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility
Figure 4 for Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility
Viaarxiv icon

IANS: Intelligibility-aware Null-steering Beamforming for Dual-Microphone Arrays

Jul 09, 2023
Wen-Yuan Ting, Syu-Siang Wang, Yu Tsao, Borching Su

Figure 1 for IANS: Intelligibility-aware Null-steering Beamforming for Dual-Microphone Arrays
Figure 2 for IANS: Intelligibility-aware Null-steering Beamforming for Dual-Microphone Arrays
Viaarxiv icon

Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula

Jun 12, 2023
Li-Chin Chen, Yi-Heng Lin, Li-Ning Peng, Feng-Ming Wang, Yu-Hsin Chen, Po-Hsun Huang, Shang-Feng Yang, Yu Tsao

Figure 1 for Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula
Figure 2 for Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula
Figure 3 for Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula
Figure 4 for Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula
Viaarxiv icon

Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features

Jun 11, 2023
Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Yu Tsao, Tai-shih Chi, Hsin-Min Wang

Figure 1 for Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features
Figure 2 for Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features
Figure 3 for Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features
Viaarxiv icon

Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion

Jun 11, 2023
Yung-Lun Chien, Hsin-Hao Chen, Ming-Chi Yen, Shu-Wei Tsai, Hsin-Min Wang, Yu Tsao, Tai-Shih Chi

Figure 1 for Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
Figure 2 for Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
Figure 3 for Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
Figure 4 for Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion
Viaarxiv icon