Alert button

"speech": models, code, and papers
Alert button

Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method

Nov 13, 2023
Mostafa Shahin, Julien Epps, Beena Ahmed

Viaarxiv icon

Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer

Add code
Bookmark button
Alert button
Sep 14, 2023
Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao

Figure 1 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Figure 2 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Figure 3 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Figure 4 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Viaarxiv icon

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

Add code
Bookmark button
Alert button
Sep 25, 2023
Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang

Figure 1 for AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
Figure 2 for AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
Figure 3 for AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
Figure 4 for AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
Viaarxiv icon

Improving Startup Success with Text Analysis

Dec 11, 2023
Emily Gavrilenko, Foaad Khosmood, Mahdi Rastad, Sadra Amiri Moghaddam

Viaarxiv icon

Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones

Oct 10, 2023
Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

Figure 1 for Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones
Figure 2 for Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones
Figure 3 for Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones
Figure 4 for Modeling of Speech-dependent Own Voice Transfer Characteristics for Hearables with In-ear Microphones
Viaarxiv icon

A Comprehensive Survey on Multi-modal Conversational Emotion Recognition with Deep Learning

Dec 10, 2023
Yuntao Shou, Tao Meng, Wei Ai, Nan Yin, Keqin Li

Viaarxiv icon

R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces

Nov 15, 2023
Heng-Jui Chang, James Glass

Viaarxiv icon

MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition

Add code
Bookmark button
Alert button
Oct 27, 2023
Jiamin Xie, John H. L. Hansen

Viaarxiv icon

Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms

Oct 11, 2023
Joseph Konan, Ojas Bhargave, Shikhar Agnihotri, Shuo Han, Yunyang Zeng, Ankit Shah, Bhiksha Raj

Figure 1 for Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms
Figure 2 for Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms
Figure 3 for Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms
Viaarxiv icon

A privacy-preserving method using secret key for convolutional neural network-based speech classification

Oct 06, 2023
Shoko Niwa, Sayaka Shiota, Hitoshi Kiya

Figure 1 for A privacy-preserving method using secret key for convolutional neural network-based speech classification
Figure 2 for A privacy-preserving method using secret key for convolutional neural network-based speech classification
Figure 3 for A privacy-preserving method using secret key for convolutional neural network-based speech classification
Figure 4 for A privacy-preserving method using secret key for convolutional neural network-based speech classification
Viaarxiv icon