Picture for Catherine Lai

Catherine Lai

Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling

Add code
Sep 25, 2024
Figure 1 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 2 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 3 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Figure 4 for Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling
Viaarxiv icon

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition

Add code
Sep 17, 2024
Figure 1 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 2 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 3 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Figure 4 for Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Viaarxiv icon

Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques

Add code
Jun 12, 2024
Figure 1 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 2 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 3 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Figure 4 for Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
Viaarxiv icon

1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem

Add code
May 30, 2024
Figure 1 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Figure 2 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Figure 3 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Figure 4 for 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Viaarxiv icon

Crossmodal ASR Error Correction with Discrete Speech Units

Add code
May 26, 2024
Figure 1 for Crossmodal ASR Error Correction with Discrete Speech Units
Figure 2 for Crossmodal ASR Error Correction with Discrete Speech Units
Figure 3 for Crossmodal ASR Error Correction with Discrete Speech Units
Figure 4 for Crossmodal ASR Error Correction with Discrete Speech Units
Viaarxiv icon

Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition

Add code
Feb 04, 2024
Figure 1 for Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition
Figure 2 for Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition
Figure 3 for Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition
Figure 4 for Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition
Viaarxiv icon

Quantifying the perceptual value of lexical and non-lexical channels in speech

Add code
Jul 07, 2023
Figure 1 for Quantifying the perceptual value of lexical and non-lexical channels in speech
Figure 2 for Quantifying the perceptual value of lexical and non-lexical channels in speech
Figure 3 for Quantifying the perceptual value of lexical and non-lexical channels in speech
Viaarxiv icon

Transfer Learning for Personality Perception via Speech Emotion Recognition

Add code
May 25, 2023
Figure 1 for Transfer Learning for Personality Perception via Speech Emotion Recognition
Figure 2 for Transfer Learning for Personality Perception via Speech Emotion Recognition
Figure 3 for Transfer Learning for Personality Perception via Speech Emotion Recognition
Viaarxiv icon

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition

Add code
May 25, 2023
Figure 1 for ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition
Figure 2 for ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition
Figure 3 for ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition
Figure 4 for ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition
Viaarxiv icon

Cross-Attention is Not Enough: Incongruity-Aware Multimodal Sentiment Analysis and Emotion Recognition

Add code
May 23, 2023
Figure 1 for Cross-Attention is Not Enough: Incongruity-Aware Multimodal Sentiment Analysis and Emotion Recognition
Figure 2 for Cross-Attention is Not Enough: Incongruity-Aware Multimodal Sentiment Analysis and Emotion Recognition
Figure 3 for Cross-Attention is Not Enough: Incongruity-Aware Multimodal Sentiment Analysis and Emotion Recognition
Figure 4 for Cross-Attention is Not Enough: Incongruity-Aware Multimodal Sentiment Analysis and Emotion Recognition
Viaarxiv icon