Alert button

"speech": models, code, and papers
Alert button

Understanding Postpartum Parents' Experiences via Two Digital Platforms

Add code
Bookmark button
Alert button
Dec 22, 2022
Xuewen Yao, Miriam Mikhelson, Megan Micheletti, Eunsol Choi, S Craig Watkins, Edison Thomaz, Kaya De Barbaro

Figure 1 for Understanding Postpartum Parents' Experiences via Two Digital Platforms
Figure 2 for Understanding Postpartum Parents' Experiences via Two Digital Platforms
Figure 3 for Understanding Postpartum Parents' Experiences via Two Digital Platforms
Figure 4 for Understanding Postpartum Parents' Experiences via Two Digital Platforms
Viaarxiv icon

FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Add code
Bookmark button
Alert button
Dec 28, 2021
Yingruo Fan, Zhaojiang Lin, Jun Saito, Wenping Wang, Taku Komura

Figure 1 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 2 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 3 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Figure 4 for FaceFormer: Speech-Driven 3D Facial Animation with Transformers
Viaarxiv icon

Talking Head Generation with Audio and Speech Related Facial Action Units

Oct 19, 2021
Sen Chen, Zhilei Liu, Jiaxing Liu, Zhengxiang Yan, Longbiao Wang

Figure 1 for Talking Head Generation with Audio and Speech Related Facial Action Units
Figure 2 for Talking Head Generation with Audio and Speech Related Facial Action Units
Figure 3 for Talking Head Generation with Audio and Speech Related Facial Action Units
Figure 4 for Talking Head Generation with Audio and Speech Related Facial Action Units
Viaarxiv icon

WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition

Add code
Bookmark button
Alert button
Oct 12, 2021
Binbin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di Wu, Zhendong Peng

Figure 1 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 2 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 3 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 4 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Viaarxiv icon

Universal Paralinguistic Speech Representations Using Self-Supervised Conformers

Oct 09, 2021
Joel Shor, Aren Jansen, Wei Han, Daniel Park, Yu Zhang

Figure 1 for Universal Paralinguistic Speech Representations Using Self-Supervised Conformers
Figure 2 for Universal Paralinguistic Speech Representations Using Self-Supervised Conformers
Figure 3 for Universal Paralinguistic Speech Representations Using Self-Supervised Conformers
Figure 4 for Universal Paralinguistic Speech Representations Using Self-Supervised Conformers
Viaarxiv icon

Building African Voices

Add code
Bookmark button
Alert button
Jul 01, 2022
Perez Ogayo, Graham Neubig, Alan W Black

Figure 1 for Building African Voices
Figure 2 for Building African Voices
Figure 3 for Building African Voices
Viaarxiv icon

Memories are One-to-Many Mapping Alleviators in Talking Face Generation

Add code
Bookmark button
Alert button
Dec 09, 2022
Anni Tang, Tianyu He, Xu Tan, Jun Ling, Runnan Li, Sheng Zhao, Li Song, Jiang Bian

Figure 1 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Figure 2 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Figure 3 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Figure 4 for Memories are One-to-Many Mapping Alleviators in Talking Face Generation
Viaarxiv icon

End-to-End Multi-Person Audio/Visual Automatic Speech Recognition

May 11, 2022
Otavio Braga, Takaki Makino, Olivier Siohan, Hank Liao

Figure 1 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Figure 2 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Figure 3 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Figure 4 for End-to-End Multi-Person Audio/Visual Automatic Speech Recognition
Viaarxiv icon

Towards Automatic Speech to Sign Language Generation

Add code
Bookmark button
Alert button
Jun 24, 2021
Parul Kapoor, Rudrabha Mukhopadhyay, Sindhu B Hegde, Vinay Namboodiri, C V Jawahar

Figure 1 for Towards Automatic Speech to Sign Language Generation
Figure 2 for Towards Automatic Speech to Sign Language Generation
Figure 3 for Towards Automatic Speech to Sign Language Generation
Figure 4 for Towards Automatic Speech to Sign Language Generation
Viaarxiv icon

Zero-shot Speech Translation

Jul 13, 2021
Tu Anh Dinh

Figure 1 for Zero-shot Speech Translation
Figure 2 for Zero-shot Speech Translation
Figure 3 for Zero-shot Speech Translation
Figure 4 for Zero-shot Speech Translation
Viaarxiv icon