Alert button

"speech": models, code, and papers
Alert button

CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network

Add code
Bookmark button
Alert button
Mar 10, 2023
Sreyan Ghosh, Manan Suri, Purva Chiniya, Utkarsh Tyagi, Sonal Kumar, Dinesh Manocha

Figure 1 for CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network
Figure 2 for CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network
Figure 3 for CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network
Figure 4 for CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network
Viaarxiv icon

Real-time speech enhancement with dynamic attention span

Add code
Bookmark button
Alert button
Feb 21, 2023
Chengyu Zheng, Yuan Zhou, Xiulian Peng, Yuan Zhang, Yan Lu

Figure 1 for Real-time speech enhancement with dynamic attention span
Figure 2 for Real-time speech enhancement with dynamic attention span
Figure 3 for Real-time speech enhancement with dynamic attention span
Figure 4 for Real-time speech enhancement with dynamic attention span
Viaarxiv icon

Developmental Bootstrapping of AIs

Aug 08, 2023
Mark Stefik, Robert Price

Figure 1 for Developmental Bootstrapping of AIs
Figure 2 for Developmental Bootstrapping of AIs
Figure 3 for Developmental Bootstrapping of AIs
Figure 4 for Developmental Bootstrapping of AIs
Viaarxiv icon

A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment

Add code
Bookmark button
Alert button
Jul 28, 2023
Carlo Aironi, Samuele Cornell, Luca Serafini, Stefano Squartini

Figure 1 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 2 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 3 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Figure 4 for A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment
Viaarxiv icon

Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase

Add code
Bookmark button
Alert button
Jul 23, 2023
Yoshiki Masuyama, Natsuki Ueno, Nobutaka Ono

Figure 1 for Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase
Figure 2 for Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase
Figure 3 for Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase
Figure 4 for Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase
Viaarxiv icon

A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization

Jul 24, 2023
Edward Fish, Umberto Michieli, Mete Ozay

Viaarxiv icon

Online Continual Learning in Keyword Spotting for Low-Resource Devices via Pooling High-Order Temporal Statistics

Jul 24, 2023
Umberto Michieli, Pablo Peso Parada, Mete Ozay

Figure 1 for Online Continual Learning in Keyword Spotting for Low-Resource Devices via Pooling High-Order Temporal Statistics
Figure 2 for Online Continual Learning in Keyword Spotting for Low-Resource Devices via Pooling High-Order Temporal Statistics
Figure 3 for Online Continual Learning in Keyword Spotting for Low-Resource Devices via Pooling High-Order Temporal Statistics
Figure 4 for Online Continual Learning in Keyword Spotting for Low-Resource Devices via Pooling High-Order Temporal Statistics
Viaarxiv icon

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Add code
Bookmark button
Alert button
Mar 01, 2023
Max Bain, Jaesung Huh, Tengda Han, Andrew Zisserman

Figure 1 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 2 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 3 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Figure 4 for WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Viaarxiv icon

Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials

Add code
Bookmark button
Alert button
Jun 20, 2023
Malikeh Ehghaghi, Marija Stanojevic, Ali Akram, Jekaterina Novikova

Figure 1 for Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials
Figure 2 for Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials
Figure 3 for Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials
Figure 4 for Factors Affecting the Performance of Automated Speaker Verification in Alzheimer's Disease Clinical Trials
Viaarxiv icon

Generative Emotional AI for Speech Emotion Recognition: The Case for Synthetic Emotional Speech Augmentation

Add code
Bookmark button
Alert button
Jan 10, 2023
Abdullah Shahid, Siddique Latif, Junaid Qadir

Figure 1 for Generative Emotional AI for Speech Emotion Recognition: The Case for Synthetic Emotional Speech Augmentation
Figure 2 for Generative Emotional AI for Speech Emotion Recognition: The Case for Synthetic Emotional Speech Augmentation
Figure 3 for Generative Emotional AI for Speech Emotion Recognition: The Case for Synthetic Emotional Speech Augmentation
Figure 4 for Generative Emotional AI for Speech Emotion Recognition: The Case for Synthetic Emotional Speech Augmentation
Viaarxiv icon