Alert button

"speech": models, code, and papers
Alert button

Representation Learning Strategies to Model Pathological Speech: Effect of Multiple Spectral Resolutions

Add code
Bookmark button
Alert button
Sep 17, 2022
Gabriel Figueiredo Miller, Juan Camilo Vásquez-Correa, Juan Rafael Orozco-Arroyave, Elmar Nöth

Figure 1 for Representation Learning Strategies to Model Pathological Speech: Effect of Multiple Spectral Resolutions
Figure 2 for Representation Learning Strategies to Model Pathological Speech: Effect of Multiple Spectral Resolutions
Figure 3 for Representation Learning Strategies to Model Pathological Speech: Effect of Multiple Spectral Resolutions
Figure 4 for Representation Learning Strategies to Model Pathological Speech: Effect of Multiple Spectral Resolutions
Viaarxiv icon

Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT

Add code
Bookmark button
Alert button
Apr 04, 2023
Ke Chen, Gordon Wichern, François G. Germain, Jonathan Le Roux

Figure 1 for Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT
Figure 2 for Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT
Figure 3 for Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT
Figure 4 for Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT
Viaarxiv icon

GazeReader: Detecting Unknown Word Using Webcam for English as a Second Language (ESL) Learners

Mar 18, 2023
Jiexin Ding, Bowen Zhao, Yuqi Huang, Yuntao Wang, Yuanchun Shi

Figure 1 for GazeReader: Detecting Unknown Word Using Webcam for English as a Second Language (ESL) Learners
Figure 2 for GazeReader: Detecting Unknown Word Using Webcam for English as a Second Language (ESL) Learners
Figure 3 for GazeReader: Detecting Unknown Word Using Webcam for English as a Second Language (ESL) Learners
Figure 4 for GazeReader: Detecting Unknown Word Using Webcam for English as a Second Language (ESL) Learners
Viaarxiv icon

Contextually-rich human affect perception using multimodal scene information

Add code
Bookmark button
Alert button
Mar 13, 2023
Digbalay Bose, Rajat Hebbar, Krishna Somandepalli, Shrikanth Narayanan

Figure 1 for Contextually-rich human affect perception using multimodal scene information
Figure 2 for Contextually-rich human affect perception using multimodal scene information
Figure 3 for Contextually-rich human affect perception using multimodal scene information
Figure 4 for Contextually-rich human affect perception using multimodal scene information
Viaarxiv icon

Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning

Add code
Bookmark button
Alert button
Oct 27, 2022
Qiu-Shi Zhu, Long Zhou, Jie Zhang, Shu-Jie Liu, Yu-Chen Hu, Li-Rong Dai

Figure 1 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Figure 2 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Figure 3 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Figure 4 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Viaarxiv icon

Sejarah dan Perkembangan Teknik Natural Language Processing (NLP) Bahasa Indonesia: Tinjauan tentang sejarah, perkembangan teknologi, dan aplikasi NLP dalam bahasa Indonesia

Mar 28, 2023
Mukhlis Amien

Figure 1 for Sejarah dan Perkembangan Teknik Natural Language Processing (NLP) Bahasa Indonesia: Tinjauan tentang sejarah, perkembangan teknologi, dan aplikasi NLP dalam bahasa Indonesia
Figure 2 for Sejarah dan Perkembangan Teknik Natural Language Processing (NLP) Bahasa Indonesia: Tinjauan tentang sejarah, perkembangan teknologi, dan aplikasi NLP dalam bahasa Indonesia
Figure 3 for Sejarah dan Perkembangan Teknik Natural Language Processing (NLP) Bahasa Indonesia: Tinjauan tentang sejarah, perkembangan teknologi, dan aplikasi NLP dalam bahasa Indonesia
Viaarxiv icon

Deep Spectro-temporal Artifacts for Detecting Synthesized Speech

Add code
Bookmark button
Alert button
Oct 11, 2022
Xiaohui Liu, Meng Liu, Lin Zhang, Linjuan Zhang, Chang Zeng, Kai Li, Nan Li, Kong Aik Lee, Longbiao Wang, Jianwu Dang

Figure 1 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 2 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 3 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 4 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Viaarxiv icon

Implementation Of Tiny Machine Learning Models On Arduino 33 BLE For Gesture And Speech Recognition

Jul 23, 2022
Viswanatha V, Ramachandra A. C, Raghavendra Prasanna, Prem Chowdary Kakarla, Viveka Simha PJ, Nishant Mohan

Figure 1 for Implementation Of Tiny Machine Learning Models On Arduino 33 BLE For Gesture And Speech Recognition
Figure 2 for Implementation Of Tiny Machine Learning Models On Arduino 33 BLE For Gesture And Speech Recognition
Figure 3 for Implementation Of Tiny Machine Learning Models On Arduino 33 BLE For Gesture And Speech Recognition
Figure 4 for Implementation Of Tiny Machine Learning Models On Arduino 33 BLE For Gesture And Speech Recognition
Viaarxiv icon

Speech Augmentation Based Unsupervised Learning for Keyword Spotting

May 28, 2022
Jian Luo, Jianzong Wang, Ning Cheng, Haobin Tang, Jing Xiao

Figure 1 for Speech Augmentation Based Unsupervised Learning for Keyword Spotting
Figure 2 for Speech Augmentation Based Unsupervised Learning for Keyword Spotting
Figure 3 for Speech Augmentation Based Unsupervised Learning for Keyword Spotting
Figure 4 for Speech Augmentation Based Unsupervised Learning for Keyword Spotting
Viaarxiv icon

Parallel Synthesis for Autoregressive Speech Generation

Add code
Bookmark button
Alert button
Apr 25, 2022
Po-chun Hsu, Da-rong Liu, Andy T. Liu, Hung-yi Lee

Figure 1 for Parallel Synthesis for Autoregressive Speech Generation
Figure 2 for Parallel Synthesis for Autoregressive Speech Generation
Figure 3 for Parallel Synthesis for Autoregressive Speech Generation
Figure 4 for Parallel Synthesis for Autoregressive Speech Generation
Viaarxiv icon