Picture for Homayoon Beigi

Homayoon Beigi

Department of Computer Science, Columbia University, Recognition Technologies, Inc., South Salem, New York, United States

Quality-Controlled Multimodal Emotion Recognition in Conversations with Identity-Based Transfer Learning and MAMBA Fusion

Add code
Nov 18, 2025
Viaarxiv icon

Spontaneous Informal Speech Dataset for Punctuation Restoration

Add code
Sep 17, 2024
Figure 1 for Spontaneous Informal Speech Dataset for Punctuation Restoration
Figure 2 for Spontaneous Informal Speech Dataset for Punctuation Restoration
Figure 3 for Spontaneous Informal Speech Dataset for Punctuation Restoration
Figure 4 for Spontaneous Informal Speech Dataset for Punctuation Restoration
Viaarxiv icon

Carnatic Raga Identification System using Rigorous Time-Delay Neural Network

Add code
May 25, 2024
Viaarxiv icon

Robust Open-Set Spoken Language Identification and the CU MultiLang Dataset

Add code
Aug 29, 2023
Viaarxiv icon

Efficient Ensemble Architecture for Multimodal Acoustic and Textual Embeddings in Punctuation Restoration using Time-Delay Neural Networks

Add code
Feb 26, 2023
Figure 1 for Efficient Ensemble Architecture for Multimodal Acoustic and Textual Embeddings in Punctuation Restoration using Time-Delay Neural Networks
Figure 2 for Efficient Ensemble Architecture for Multimodal Acoustic and Textual Embeddings in Punctuation Restoration using Time-Delay Neural Networks
Figure 3 for Efficient Ensemble Architecture for Multimodal Acoustic and Textual Embeddings in Punctuation Restoration using Time-Delay Neural Networks
Figure 4 for Efficient Ensemble Architecture for Multimodal Acoustic and Textual Embeddings in Punctuation Restoration using Time-Delay Neural Networks
Viaarxiv icon

A Transaction Represented with Weighted Finite-State Transducers

Add code
Feb 01, 2023
Viaarxiv icon

Modernizing Open-Set Speech Language Identification

Add code
May 20, 2022
Figure 1 for Modernizing Open-Set Speech Language Identification
Figure 2 for Modernizing Open-Set Speech Language Identification
Figure 3 for Modernizing Open-Set Speech Language Identification
Figure 4 for Modernizing Open-Set Speech Language Identification
Viaarxiv icon

Bi-LSTM Scoring Based Similarity Measurement with Agglomerative Hierarchical Clustering (AHC) for Speaker Diarization

Add code
May 19, 2022
Figure 1 for Bi-LSTM Scoring Based Similarity Measurement with Agglomerative Hierarchical Clustering (AHC) for Speaker Diarization
Figure 2 for Bi-LSTM Scoring Based Similarity Measurement with Agglomerative Hierarchical Clustering (AHC) for Speaker Diarization
Figure 3 for Bi-LSTM Scoring Based Similarity Measurement with Agglomerative Hierarchical Clustering (AHC) for Speaker Diarization
Figure 4 for Bi-LSTM Scoring Based Similarity Measurement with Agglomerative Hierarchical Clustering (AHC) for Speaker Diarization
Viaarxiv icon

Automatic Spoken Language Identification using a Time-Delay Neural Network

Add code
May 19, 2022
Figure 1 for Automatic Spoken Language Identification using a Time-Delay Neural Network
Figure 2 for Automatic Spoken Language Identification using a Time-Delay Neural Network
Figure 3 for Automatic Spoken Language Identification using a Time-Delay Neural Network
Figure 4 for Automatic Spoken Language Identification using a Time-Delay Neural Network
Viaarxiv icon

Multi-Modal Emotion Detection with Transfer Learning

Add code
Nov 13, 2020
Figure 1 for Multi-Modal Emotion Detection with Transfer Learning
Figure 2 for Multi-Modal Emotion Detection with Transfer Learning
Figure 3 for Multi-Modal Emotion Detection with Transfer Learning
Figure 4 for Multi-Modal Emotion Detection with Transfer Learning
Viaarxiv icon