Alert button

"speech": models, code, and papers
Alert button

SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection

Add code
Bookmark button
Alert button
Nov 11, 2022
Jiangyan Yi, Chenglong Wang, Jianhua Tao, Zhengkun Tian, Cunhang Fan, Haoxin Ma, Ruibo Fu

Figure 1 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Figure 2 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Figure 3 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Figure 4 for SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection
Viaarxiv icon

Automated Sex Classification of Children's Voices and Changes in Differentiating Factors with Age

Add code
Bookmark button
Alert button
Sep 27, 2022
Fuling Chen, Roberto Togneri, Murray Maybery, Diana Weiting Tan

Figure 1 for Automated Sex Classification of Children's Voices and Changes in Differentiating Factors with Age
Figure 2 for Automated Sex Classification of Children's Voices and Changes in Differentiating Factors with Age
Figure 3 for Automated Sex Classification of Children's Voices and Changes in Differentiating Factors with Age
Figure 4 for Automated Sex Classification of Children's Voices and Changes in Differentiating Factors with Age
Viaarxiv icon

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Add code
Bookmark button
Alert button
Nov 02, 2021
Peter Wu, Jiatong Shi, Yifan Zhong, Shinji Watanabe, Alan W Black

Figure 1 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Figure 2 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Figure 3 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Figure 4 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Viaarxiv icon

DAL: Feature Learning from Overt Speech to Decode Imagined Speech-based EEG Signals with Convolutional Autoencoder

Jul 15, 2021
Dae-Hyeok Lee, Sung-Jin Kim, Seong-Whan Lee

Figure 1 for DAL: Feature Learning from Overt Speech to Decode Imagined Speech-based EEG Signals with Convolutional Autoencoder
Figure 2 for DAL: Feature Learning from Overt Speech to Decode Imagined Speech-based EEG Signals with Convolutional Autoencoder
Figure 3 for DAL: Feature Learning from Overt Speech to Decode Imagined Speech-based EEG Signals with Convolutional Autoencoder
Figure 4 for DAL: Feature Learning from Overt Speech to Decode Imagined Speech-based EEG Signals with Convolutional Autoencoder
Viaarxiv icon

Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals

Jun 27, 2022
Jinhan Wang, Vijay Ravi, Jonathan Flint, Abeer Alwan

Figure 1 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 2 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 3 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 4 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Viaarxiv icon

QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus

Add code
Bookmark button
Alert button
Jun 24, 2021
Hamdy Mubarak, Amir Hussein, Shammur Absar Chowdhury, Ahmed Ali

Figure 1 for QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus
Figure 2 for QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus
Figure 3 for QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus
Figure 4 for QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus
Viaarxiv icon

Efficient yet Competitive Speech Translation: FBK@IWSLT2022

Add code
Bookmark button
Alert button
May 05, 2022
Marco Gaido, Sara Papi, Dennis Fucci, Giuseppe Fiameni, Matteo Negri, Marco Turchi

Figure 1 for Efficient yet Competitive Speech Translation: FBK@IWSLT2022
Figure 2 for Efficient yet Competitive Speech Translation: FBK@IWSLT2022
Figure 3 for Efficient yet Competitive Speech Translation: FBK@IWSLT2022
Figure 4 for Efficient yet Competitive Speech Translation: FBK@IWSLT2022
Viaarxiv icon

Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models

Jun 29, 2022
Daniel Bermuth, Alexander Poeppel, Wolfgang Reif

Figure 1 for Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models
Figure 2 for Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models
Figure 3 for Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models
Figure 4 for Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models
Viaarxiv icon

RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing

Feb 22, 2022
Efthymios Tzinis, Yossi Adi, Vamsi Krishna Ithapu, Buye Xu, Paris Smaragdis, Anurag Kumar

Figure 1 for RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Figure 2 for RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Figure 3 for RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Figure 4 for RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Viaarxiv icon

DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning

Add code
Bookmark button
Alert button
Mar 09, 2022
Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-wen Yang, Hsuan-Jui Chen, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-shan Lee

Figure 1 for DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning
Figure 2 for DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning
Figure 3 for DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning
Figure 4 for DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning
Viaarxiv icon