Alert button

"speech": models, code, and papers
Alert button

Subspace Hybrid Beamforming for Head-worn Microphone Arrays

Mar 15, 2023
Sina Hafezi, Alastair H. Moore, Pierre Guiraud, Patrick A. Naylor, Jacob Donley, Vladimir Tourbabin, Thomas Lunner

Figure 1 for Subspace Hybrid Beamforming for Head-worn Microphone Arrays
Figure 2 for Subspace Hybrid Beamforming for Head-worn Microphone Arrays
Figure 3 for Subspace Hybrid Beamforming for Head-worn Microphone Arrays
Figure 4 for Subspace Hybrid Beamforming for Head-worn Microphone Arrays
Viaarxiv icon

SpeechBlender: Speech Augmentation Framework for Mispronunciation Data Generation

Nov 02, 2022
Yassine El Kheir, Shammur Absar Chowdhury, Hamdy Mubarak, Shazia Afzal, Ahmed Ali

Figure 1 for SpeechBlender: Speech Augmentation Framework for Mispronunciation Data Generation
Figure 2 for SpeechBlender: Speech Augmentation Framework for Mispronunciation Data Generation
Figure 3 for SpeechBlender: Speech Augmentation Framework for Mispronunciation Data Generation
Figure 4 for SpeechBlender: Speech Augmentation Framework for Mispronunciation Data Generation
Viaarxiv icon

PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS

Add code
Bookmark button
Alert button
Feb 24, 2023
Junhyeok Lee, Wonbin Jung, Hyunjae Cho, Jaeyeon Kim

Figure 1 for PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS
Figure 2 for PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS
Figure 3 for PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS
Figure 4 for PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS
Viaarxiv icon

Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Nov 14, 2022
Jiaxin Ye, Xincheng Wen, Yujie Wei, Yong Xu, Kunhong Liu, Hongming Shan

Figure 1 for Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition
Figure 2 for Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition
Figure 3 for Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition
Figure 4 for Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition
Viaarxiv icon

RedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech Translation without Quality Compromise

Add code
Bookmark button
Alert button
Oct 16, 2022
Jinming Zhao, Hao Yang, Gholamreza Haffari, Ehsan Shareghi

Figure 1 for RedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech Translation without Quality Compromise
Figure 2 for RedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech Translation without Quality Compromise
Figure 3 for RedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech Translation without Quality Compromise
Figure 4 for RedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech Translation without Quality Compromise
Viaarxiv icon

Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video

Feb 27, 2023
Minsu Kim, Chae Won Kim, Yong Man Ro

Figure 1 for Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video
Figure 2 for Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video
Figure 3 for Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video
Figure 4 for Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video
Viaarxiv icon

Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning

Dec 10, 2022
Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng

Figure 1 for Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Figure 2 for Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Figure 3 for Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Figure 4 for Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Viaarxiv icon

Partially Adaptive Multichannel Joint Reduction of Ego-noise and Environmental Noise

Mar 27, 2023
Huajian Fang, Niklas Wittmer, Johannes Twiefel, Stefan Wermter, Timo Gerkmann

Figure 1 for Partially Adaptive Multichannel Joint Reduction of Ego-noise and Environmental Noise
Figure 2 for Partially Adaptive Multichannel Joint Reduction of Ego-noise and Environmental Noise
Figure 3 for Partially Adaptive Multichannel Joint Reduction of Ego-noise and Environmental Noise
Figure 4 for Partially Adaptive Multichannel Joint Reduction of Ego-noise and Environmental Noise
Viaarxiv icon

ConceptBeam: Concept Driven Target Speech Extraction

Jul 25, 2022
Yasunori Ohishi, Marc Delcroix, Tsubasa Ochiai, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Akisato Kimura, Noboru Harada, Kunio Kashino

Figure 1 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 2 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 3 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 4 for ConceptBeam: Concept Driven Target Speech Extraction
Viaarxiv icon

Emotional Expression Detection in Spoken Language Employing Machine Learning Algorithms

Apr 20, 2023
Mehrab Hosain, Most. Yeasmin Arafat, Gazi Zahirul Islam, Jia Uddin, Md. Mobarak Hossain, Fatema Alam

Figure 1 for Emotional Expression Detection in Spoken Language Employing Machine Learning Algorithms
Figure 2 for Emotional Expression Detection in Spoken Language Employing Machine Learning Algorithms
Figure 3 for Emotional Expression Detection in Spoken Language Employing Machine Learning Algorithms
Figure 4 for Emotional Expression Detection in Spoken Language Employing Machine Learning Algorithms
Viaarxiv icon