Picture for Peter Bell

Peter Bell

TTSDS -- Text-to-Speech Distribution Score

Add code
Jul 17, 2024
Viaarxiv icon

Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques

Add code
Jun 12, 2024
Figure 1 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 2 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 3 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 4 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Viaarxiv icon

Phonetic Error Analysis of Raw Waveform Acoustic Models with Parametric and Non-Parametric CNNs

Add code
Jun 02, 2024
Viaarxiv icon

Explainable Attribute-Based Speaker Verification

Add code
May 30, 2024
Viaarxiv icon

1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem

Add code
May 30, 2024
Viaarxiv icon

Crossmodal ASR Error Correction with Discrete Speech Units

Add code
May 26, 2024
Viaarxiv icon

LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

Add code
Apr 22, 2024
Figure 1 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Figure 2 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Figure 3 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Figure 4 for LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Viaarxiv icon

Quantifying the perceptual value of lexical and non-lexical channels in speech

Add code
Jul 07, 2023
Figure 1 for Quantifying the perceptual value of lexical and non-lexical channels in speech
Figure 2 for Quantifying the perceptual value of lexical and non-lexical channels in speech
Figure 3 for Quantifying the perceptual value of lexical and non-lexical channels in speech
Viaarxiv icon

Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition

Add code
May 29, 2023
Figure 1 for Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition
Figure 2 for Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition
Figure 3 for Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition
Figure 4 for Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition
Viaarxiv icon

Transfer Learning for Personality Perception via Speech Emotion Recognition

Add code
May 25, 2023
Figure 1 for Transfer Learning for Personality Perception via Speech Emotion Recognition
Figure 2 for Transfer Learning for Personality Perception via Speech Emotion Recognition
Figure 3 for Transfer Learning for Personality Perception via Speech Emotion Recognition
Viaarxiv icon