Picture for Tan Lee

Tan Lee

An ASR-free Fluency Scoring Approach with Self-Supervised Learning

Add code
Mar 13, 2023
Figure 1 for An ASR-free Fluency Scoring Approach with Self-Supervised Learning
Figure 2 for An ASR-free Fluency Scoring Approach with Self-Supervised Learning
Figure 3 for An ASR-free Fluency Scoring Approach with Self-Supervised Learning
Figure 4 for An ASR-free Fluency Scoring Approach with Self-Supervised Learning
Viaarxiv icon

Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring

Add code
Mar 13, 2023
Figure 1 for Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring
Figure 2 for Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring
Figure 3 for Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring
Figure 4 for Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring
Viaarxiv icon

Covariance Regularization for Probabilistic Linear Discriminant Analysis

Add code
Dec 06, 2022
Viaarxiv icon

Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition

Add code
Dec 06, 2022
Viaarxiv icon

Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization

Add code
Nov 14, 2022
Figure 1 for Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization
Figure 2 for Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization
Figure 3 for Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization
Figure 4 for Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization
Viaarxiv icon

Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification

Add code
Oct 31, 2022
Figure 1 for Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification
Figure 2 for Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification
Figure 3 for Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification
Figure 4 for Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification
Viaarxiv icon

iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre

Add code
Jun 29, 2022
Figure 1 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 2 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 3 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 4 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Viaarxiv icon

iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition

Add code
Jun 27, 2022
Figure 1 for iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition
Figure 2 for iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition
Figure 3 for iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition
Figure 4 for iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition
Viaarxiv icon

Transport-Oriented Feature Aggregation for Speaker Embedding Learning

Add code
Jun 26, 2022
Figure 1 for Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Figure 2 for Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Figure 3 for Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Figure 4 for Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Viaarxiv icon

Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification

Add code
Jun 15, 2022
Figure 1 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Figure 2 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Figure 3 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Figure 4 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Viaarxiv icon