Alert button
Picture for Yu Tsao

Yu Tsao

Alert button

Multi-objective Non-intrusive Hearing-aid Speech Assessment Model

Add code
Bookmark button
Alert button
Nov 15, 2023
Hsin-Tien Chiang, Szu-Wei Fu, Hsin-Min Wang, Yu Tsao, John H. L. Hansen

Figure 1 for Multi-objective Non-intrusive Hearing-aid Speech Assessment Model
Figure 2 for Multi-objective Non-intrusive Hearing-aid Speech Assessment Model
Figure 3 for Multi-objective Non-intrusive Hearing-aid Speech Assessment Model
Figure 4 for Multi-objective Non-intrusive Hearing-aid Speech Assessment Model
Viaarxiv icon

AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Video Deepfake Detection

Add code
Bookmark button
Alert button
Nov 05, 2023
Sahibzada Adil Shahzad, Ammarah Hashmi, Yan-Tsung Peng, Yu Tsao, Hsin-Min Wang

Viaarxiv icon

Neural domain alignment for spoken language recognition based on optimal transport

Add code
Bookmark button
Alert button
Oct 20, 2023
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Viaarxiv icon

AVTENet: Audio-Visual Transformer-based Ensemble Network Exploiting Multiple Experts for Video Deepfake Detection

Add code
Bookmark button
Alert button
Oct 19, 2023
Ammarah Hashmi, Sahibzada Adil Shahzad, Chia-Wen Lin, Yu Tsao, Hsin-Min Wang

Viaarxiv icon

The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains

Add code
Bookmark button
Alert button
Oct 07, 2023
Erica Cooper, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi

Figure 1 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 2 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 3 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 4 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Viaarxiv icon

Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR

Add code
Bookmark button
Alert button
Sep 28, 2023
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Viaarxiv icon

Cross-modal Alignment with Optimal Transport for CTC-based ASR

Add code
Bookmark button
Alert button
Sep 24, 2023
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Viaarxiv icon

A Study on Incorporating Whisper for Robust Speech Assessment

Add code
Bookmark button
Alert button
Sep 22, 2023
Ryandhimas E. Zezario, Yu-Wen Chen, Yu Tsao, Szu-Wei Fu, Hsin-Min Wang, Chiou-Shann Fuh

Figure 1 for A Study on Incorporating Whisper for Robust Speech Assessment
Figure 2 for A Study on Incorporating Whisper for Robust Speech Assessment
Figure 3 for A Study on Incorporating Whisper for Robust Speech Assessment
Figure 4 for A Study on Incorporating Whisper for Robust Speech Assessment
Viaarxiv icon

Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement

Add code
Bookmark button
Alert button
Sep 20, 2023
Shafique Ahmed, Chia-Wei Chen, Wenze Ren, Chin-Jou Li, Ernie Chu, Jun-Cheng Chen, Amir Hussain, Hsin-Min Wang, Yu Tsao, Jen-Cheng Hou

Figure 1 for Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Figure 2 for Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Figure 3 for Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Figure 4 for Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Viaarxiv icon