Alert button

"speech": models, code, and papers
Alert button

Audio-visual video face hallucination with frequency supervision and cross modality support by speech based lip reading loss

Nov 20, 2022
Shailza Sharma, Abhinav Dhall, Vinay Kumar, Vivek Singh Bawa

Figure 1 for Audio-visual video face hallucination with frequency supervision and cross modality support by speech based lip reading loss
Figure 2 for Audio-visual video face hallucination with frequency supervision and cross modality support by speech based lip reading loss
Figure 3 for Audio-visual video face hallucination with frequency supervision and cross modality support by speech based lip reading loss
Figure 4 for Audio-visual video face hallucination with frequency supervision and cross modality support by speech based lip reading loss
Viaarxiv icon

Improving the transferability of speech separation by meta-learning

Add code
Bookmark button
Alert button
Mar 11, 2022
Kuan-Po Huang, Yuan-Kuei Wu, Hung-yi Lee

Figure 1 for Improving the transferability of speech separation by meta-learning
Figure 2 for Improving the transferability of speech separation by meta-learning
Figure 3 for Improving the transferability of speech separation by meta-learning
Viaarxiv icon

Dual-path Attention is All You Need for Audio-Visual Speech Extraction

Jul 09, 2022
Zhongweiyang Xu, Xulin Fan, Mark Hasegawa-Johnson

Figure 1 for Dual-path Attention is All You Need for Audio-Visual Speech Extraction
Figure 2 for Dual-path Attention is All You Need for Audio-Visual Speech Extraction
Figure 3 for Dual-path Attention is All You Need for Audio-Visual Speech Extraction
Viaarxiv icon

Topic Modeling Based on Two-Step Flow Theory: Application to Tweets about Bitcoin

Mar 03, 2023
Aos Mulahuwaish, Matthew Loucks, Basheer Qolomany, Ala Al-Fuqaha

Figure 1 for Topic Modeling Based on Two-Step Flow Theory: Application to Tweets about Bitcoin
Figure 2 for Topic Modeling Based on Two-Step Flow Theory: Application to Tweets about Bitcoin
Figure 3 for Topic Modeling Based on Two-Step Flow Theory: Application to Tweets about Bitcoin
Figure 4 for Topic Modeling Based on Two-Step Flow Theory: Application to Tweets about Bitcoin
Viaarxiv icon

Low-Resource End-to-end Sanskrit TTS using Tacotron2, WaveGlow and Transfer Learning

Dec 07, 2022
Ankur Debnath, Shridevi S Patil, Gangotri Nadiger, Ramakrishnan Angarai Ganesan

Figure 1 for Low-Resource End-to-end Sanskrit TTS using Tacotron2, WaveGlow and Transfer Learning
Figure 2 for Low-Resource End-to-end Sanskrit TTS using Tacotron2, WaveGlow and Transfer Learning
Figure 3 for Low-Resource End-to-end Sanskrit TTS using Tacotron2, WaveGlow and Transfer Learning
Figure 4 for Low-Resource End-to-end Sanskrit TTS using Tacotron2, WaveGlow and Transfer Learning
Viaarxiv icon

Approaching an unknown communication system by latent space exploration and causal inference

Add code
Bookmark button
Alert button
Mar 20, 2023
Gašper Beguš, Andrej Leban, Shane Gero

Figure 1 for Approaching an unknown communication system by latent space exploration and causal inference
Figure 2 for Approaching an unknown communication system by latent space exploration and causal inference
Figure 3 for Approaching an unknown communication system by latent space exploration and causal inference
Figure 4 for Approaching an unknown communication system by latent space exploration and causal inference
Viaarxiv icon

Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics

Feb 23, 2023
Surbhi Madan, Monika Gahalawat, Tanaya Guha, Roland Goecke, Ramanathan Subramanian

Figure 1 for Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics
Figure 2 for Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics
Figure 3 for Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics
Figure 4 for Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics
Viaarxiv icon

Generalization of Auto-Regressive Hidden Markov Models to Non-Linear Dynamics and Non-Euclidean Observation Space

Add code
Bookmark button
Alert button
Feb 23, 2023
Michele Ginesi, Paolo Fiorini

Figure 1 for Generalization of Auto-Regressive Hidden Markov Models to Non-Linear Dynamics and Non-Euclidean Observation Space
Figure 2 for Generalization of Auto-Regressive Hidden Markov Models to Non-Linear Dynamics and Non-Euclidean Observation Space
Figure 3 for Generalization of Auto-Regressive Hidden Markov Models to Non-Linear Dynamics and Non-Euclidean Observation Space
Figure 4 for Generalization of Auto-Regressive Hidden Markov Models to Non-Linear Dynamics and Non-Euclidean Observation Space
Viaarxiv icon

Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification

Feb 23, 2023
Qiongqiong Wang, Kong Aik Lee, Tianchi Liu

Figure 1 for Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification
Figure 2 for Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification
Figure 3 for Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification
Viaarxiv icon

Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation

Add code
Bookmark button
Alert button
Oct 27, 2022
Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

Figure 1 for Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation
Figure 2 for Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation
Figure 3 for Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation
Figure 4 for Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation
Viaarxiv icon