Alert button

"speech": models, code, and papers
Alert button

DST: Deformable Speech Transformer for Emotion Recognition

Add code
Bookmark button
Alert button
Feb 27, 2023
Weidong Chen, Xiaofen Xing, Xiangmin Xu, Jianxin Pang, Lan Du

Figure 1 for DST: Deformable Speech Transformer for Emotion Recognition
Figure 2 for DST: Deformable Speech Transformer for Emotion Recognition
Figure 3 for DST: Deformable Speech Transformer for Emotion Recognition
Figure 4 for DST: Deformable Speech Transformer for Emotion Recognition
Viaarxiv icon

Developmental Bootstrapping of AIs

Aug 17, 2023
Mark Stefik, Robert Price

Figure 1 for Developmental Bootstrapping of AIs
Figure 2 for Developmental Bootstrapping of AIs
Figure 3 for Developmental Bootstrapping of AIs
Figure 4 for Developmental Bootstrapping of AIs
Viaarxiv icon

Speech Enhancement for Virtual Meetings on Cellular Networks

Add code
Bookmark button
Alert button
Feb 16, 2023
Hojeong Lee, Minseon Gwak, Kawon Lee, Minjeong Kim, Joseph Konan, Ojas Bhargave

Figure 1 for Speech Enhancement for Virtual Meetings on Cellular Networks
Figure 2 for Speech Enhancement for Virtual Meetings on Cellular Networks
Figure 3 for Speech Enhancement for Virtual Meetings on Cellular Networks
Figure 4 for Speech Enhancement for Virtual Meetings on Cellular Networks
Viaarxiv icon

Deep Learning-based F0 Synthesis for Speaker Anonymization

Jun 29, 2023
Ünal Ege Gaznepoglu, Nils Peters

Figure 1 for Deep Learning-based F0 Synthesis for Speaker Anonymization
Figure 2 for Deep Learning-based F0 Synthesis for Speaker Anonymization
Figure 3 for Deep Learning-based F0 Synthesis for Speaker Anonymization
Figure 4 for Deep Learning-based F0 Synthesis for Speaker Anonymization
Viaarxiv icon

WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions

Add code
Bookmark button
Alert button
Mar 03, 2023
Jun Rekimoto

Figure 1 for WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions
Figure 2 for WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions
Figure 3 for WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions
Figure 4 for WESPER: Zero-shot and Realtime Whisper to Normal Voice Conversion for Whisper-based Speech Interactions
Viaarxiv icon

GesGPT: Speech Gesture Synthesis With Text Parsing from GPT

Mar 23, 2023
Nan Gao, Zeyu Zhao, Zhi Zeng, Shuwu Zhang, Dongdong Weng

Figure 1 for GesGPT: Speech Gesture Synthesis With Text Parsing from GPT
Figure 2 for GesGPT: Speech Gesture Synthesis With Text Parsing from GPT
Figure 3 for GesGPT: Speech Gesture Synthesis With Text Parsing from GPT
Figure 4 for GesGPT: Speech Gesture Synthesis With Text Parsing from GPT
Viaarxiv icon

Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks

Aug 18, 2023
Shu Wang, Kun Sun, Qi Li

Figure 1 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Figure 2 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Figure 3 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Figure 4 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Viaarxiv icon

Speaker Diarization of Scripted Audiovisual Content

Add code
Bookmark button
Alert button
Aug 04, 2023
Yogesh Virkar, Brian Thompson, Rohit Paturi, Sundararajan Srinivasan, Marcello Federico

Viaarxiv icon

A Critical Review of Physics-Informed Machine Learning Applications in Subsurface Energy Systems

Aug 06, 2023
Abdeldjalil Latrach, Mohamed Lamine Malki, Misael Morales, Mohamed Mehana, Minou Rabiei

Figure 1 for A Critical Review of Physics-Informed Machine Learning Applications in Subsurface Energy Systems
Figure 2 for A Critical Review of Physics-Informed Machine Learning Applications in Subsurface Energy Systems
Figure 3 for A Critical Review of Physics-Informed Machine Learning Applications in Subsurface Energy Systems
Figure 4 for A Critical Review of Physics-Informed Machine Learning Applications in Subsurface Energy Systems
Viaarxiv icon

Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection

Add code
Bookmark button
Alert button
Mar 04, 2023
Md Rabiul Awal, Roy Ka-Wei Lee, Eshaan Tanwar, Tanmay Garg, Tanmoy Chakraborty

Figure 1 for Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection
Figure 2 for Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection
Figure 3 for Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection
Figure 4 for Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection
Viaarxiv icon