Alert button

"speech": models, code, and papers
Alert button

LEACE: Perfect linear concept erasure in closed form

Add code
Bookmark button
Alert button
Jun 23, 2023
Nora Belrose, David Schneider-Joseph, Shauli Ravfogel, Ryan Cotterell, Edward Raff, Stella Biderman

Figure 1 for LEACE: Perfect linear concept erasure in closed form
Figure 2 for LEACE: Perfect linear concept erasure in closed form
Figure 3 for LEACE: Perfect linear concept erasure in closed form
Figure 4 for LEACE: Perfect linear concept erasure in closed form
Viaarxiv icon

DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation

Feb 21, 2023
Shuo Wang, Xiangyu Kong, Xiulian Peng, Hesam Movassagh, Vinod Prakash, Yan Lu

Figure 1 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 2 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 3 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Figure 4 for DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation
Viaarxiv icon

PAMP: A unified framework boosting low resource automatic speech recognition

Add code
Bookmark button
Alert button
Feb 05, 2023
Zeping Min, Qian Ge, Zhong Li, Weinan E

Figure 1 for PAMP: A unified framework boosting low resource automatic speech recognition
Figure 2 for PAMP: A unified framework boosting low resource automatic speech recognition
Figure 3 for PAMP: A unified framework boosting low resource automatic speech recognition
Figure 4 for PAMP: A unified framework boosting low resource automatic speech recognition
Viaarxiv icon

Voice Conversion With Just Nearest Neighbors

Add code
Bookmark button
Alert button
May 30, 2023
Matthew Baas, Benjamin van Niekerk, Herman Kamper

Figure 1 for Voice Conversion With Just Nearest Neighbors
Figure 2 for Voice Conversion With Just Nearest Neighbors
Figure 3 for Voice Conversion With Just Nearest Neighbors
Viaarxiv icon

A Stutter Seldom Comes Alone -- Cross-Corpus Stuttering Detection as a Multi-label Problem

May 30, 2023
Sebastian P. Bayerl, Dominik Wagner, Ilja Baumann, Florian Hönig, Tobias Bocklet, Elmar Nöth, Korbinian Riedhammer

Figure 1 for A Stutter Seldom Comes Alone -- Cross-Corpus Stuttering Detection as a Multi-label Problem
Figure 2 for A Stutter Seldom Comes Alone -- Cross-Corpus Stuttering Detection as a Multi-label Problem
Figure 3 for A Stutter Seldom Comes Alone -- Cross-Corpus Stuttering Detection as a Multi-label Problem
Figure 4 for A Stutter Seldom Comes Alone -- Cross-Corpus Stuttering Detection as a Multi-label Problem
Viaarxiv icon

Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models

Mar 15, 2023
Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw

Figure 1 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 2 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 3 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Figure 4 for Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
Viaarxiv icon

Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP

Add code
Bookmark button
Alert button
Apr 03, 2023
Nikolaos Antoniou, Athanasios Katsamanis, Theodoros Giannakopoulos, Shrikanth Narayanan

Figure 1 for Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP
Figure 2 for Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP
Figure 3 for Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP
Viaarxiv icon

UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion

Add code
Bookmark button
Alert button
Jan 10, 2023
Haogeng Liu, Tao Wang, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Jianhua Tao

Figure 1 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 2 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 3 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Figure 4 for UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion
Viaarxiv icon

Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillation

Jun 27, 2023
Haitao Tang, Yu Fu, Lei Sun, Jiabin Xue, Dan Liu, Yongchao Li, Zhiqiang Ma, Minghui Wu, Jia Pan, Genshun Wan, Ming'en Zhao

Figure 1 for Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillation
Figure 2 for Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillation
Figure 3 for Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillation
Figure 4 for Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillation
Viaarxiv icon

Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations

Add code
Bookmark button
Alert button
Mar 01, 2023
Siyuan Shen, Feng Liu, Aimin Zhou

Figure 1 for Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Figure 2 for Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Figure 3 for Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Figure 4 for Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Viaarxiv icon