Alert button
Picture for Jyh-Shing Roger Jang

Jyh-Shing Roger Jang

Alert button

EMO-SUPERB: An In-depth Look at Speech Emotion Recognition

Add code
Bookmark button
Alert button
Feb 22, 2024
Haibin Wu, Huang-Cheng Chou, Kai-Wei Chang, Lucas Goncalves, Jiawei Du, Jyh-Shing Roger Jang, Chi-Chun Lee, Hung-Yi Lee

Viaarxiv icon

Applications of Large Language Models in Data Processing: Innovative Approaches to Segmenting and Renewing Information

Add code
Bookmark button
Alert button
Nov 27, 2023
Yu-Chen Lin, Akhilesh Kumar, Wen-Liang Zhang, Norman Chang, Muhammad Zakir, Rucha Apte, Chao Wang, Jyh-Shing Roger Jang

Viaarxiv icon

Adapting pretrained speech model for Mandarin lyrics transcription and alignment

Add code
Bookmark button
Alert button
Nov 21, 2023
Jun-You Wang, Chon-In Leong, Yu-Chen Lin, Li Su, Jyh-Shing Roger Jang

Viaarxiv icon

WC-SBERT: Zero-Shot Text Classification via SBERT with Self-Training for Wikipedia Categories

Add code
Bookmark button
Alert button
Jul 28, 2023
Te-Yu Chi, Yu-Meng Tang, Chia-Wen Lu, Qiu-Xia Zhang, Jyh-Shing Roger Jang

Figure 1 for WC-SBERT: Zero-Shot Text Classification via SBERT with Self-Training for Wikipedia Categories
Figure 2 for WC-SBERT: Zero-Shot Text Classification via SBERT with Self-Training for Wikipedia Categories
Figure 3 for WC-SBERT: Zero-Shot Text Classification via SBERT with Self-Training for Wikipedia Categories
Figure 4 for WC-SBERT: Zero-Shot Text Classification via SBERT with Self-Training for Wikipedia Categories
Viaarxiv icon

Personalized Audio Quality Preference Prediction

Add code
Bookmark button
Alert button
Feb 16, 2023
Chung-Che Wang, Yu-Chun Lin, Yu-Teng Hsu, Jyh-Shing Roger Jang

Figure 1 for Personalized Audio Quality Preference Prediction
Figure 2 for Personalized Audio Quality Preference Prediction
Figure 3 for Personalized Audio Quality Preference Prediction
Figure 4 for Personalized Audio Quality Preference Prediction
Viaarxiv icon

Multimodal Transformer Distillation for Audio-Visual Synchronization

Add code
Bookmark button
Alert button
Oct 27, 2022
Xuanjun Chen, Haibin Wu, Chung-Che Wang, Hung-yi Lee, Jyh-Shing Roger Jang

Figure 1 for Multimodal Transformer Distillation for Audio-Visual Synchronization
Figure 2 for Multimodal Transformer Distillation for Audio-Visual Synchronization
Figure 3 for Multimodal Transformer Distillation for Audio-Visual Synchronization
Figure 4 for Multimodal Transformer Distillation for Audio-Visual Synchronization
Viaarxiv icon

Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection

Add code
Bookmark button
Alert button
Oct 03, 2022
Xuanjun Chen, Haibin Wu, Helen Meng, Hung-yi Lee, Jyh-Shing Roger Jang

Figure 1 for Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection
Figure 2 for Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection
Figure 3 for Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection
Figure 4 for Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection
Viaarxiv icon

Training strategy for a lightweight countermeasure model for automatic speaker verification

Add code
Bookmark button
Alert button
Apr 08, 2022
Yen-Lun Liao, Xuanjun Chen, Chung-Che Wang, Jyh-Shing Roger Jang

Figure 1 for Training strategy for a lightweight countermeasure model for automatic speaker verification
Figure 2 for Training strategy for a lightweight countermeasure model for automatic speaker verification
Figure 3 for Training strategy for a lightweight countermeasure model for automatic speaker verification
Figure 4 for Training strategy for a lightweight countermeasure model for automatic speaker verification
Viaarxiv icon

towards automatic transcription of polyphonic electric guitar music:a new dataset and a multi-loss transformer model

Add code
Bookmark button
Alert button
Feb 20, 2022
Yu-Hua Chen, Wen-Yi Hsiao, Tsu-Kuang Hsieh, Jyh-Shing Roger Jang, Yi-Hsuan Yang

Figure 1 for towards automatic transcription of polyphonic electric guitar music:a new dataset and a multi-loss transformer model
Figure 2 for towards automatic transcription of polyphonic electric guitar music:a new dataset and a multi-loss transformer model
Figure 3 for towards automatic transcription of polyphonic electric guitar music:a new dataset and a multi-loss transformer model
Figure 4 for towards automatic transcription of polyphonic electric guitar music:a new dataset and a multi-loss transformer model
Viaarxiv icon

Singer separation for karaoke content generation

Add code
Bookmark button
Alert button
Oct 13, 2021
Hsuan-Yu Chen, Xuanjun Chen, Jyh-Shing Roger Jang

Figure 1 for Singer separation for karaoke content generation
Figure 2 for Singer separation for karaoke content generation
Figure 3 for Singer separation for karaoke content generation
Figure 4 for Singer separation for karaoke content generation
Viaarxiv icon