Alert button

"speech": models, code, and papers
Alert button

Unsupervised Multimodal Word Discovery based on Double Articulation Analysis with Co-occurrence cues

Add code
Bookmark button
Alert button
Jan 18, 2022
Akira Taniguchi, Hiroaki Murakami, Ryo Ozaki, Tadahiro Taniguchi

Figure 1 for Unsupervised Multimodal Word Discovery based on Double Articulation Analysis with Co-occurrence cues
Figure 2 for Unsupervised Multimodal Word Discovery based on Double Articulation Analysis with Co-occurrence cues
Figure 3 for Unsupervised Multimodal Word Discovery based on Double Articulation Analysis with Co-occurrence cues
Figure 4 for Unsupervised Multimodal Word Discovery based on Double Articulation Analysis with Co-occurrence cues
Viaarxiv icon

A Review of Language and Speech Features for Cognitive-Linguistic Assessment

Jun 04, 2019
Rohit Voleti, Julie M. Liss, Visar Berisha

Figure 1 for A Review of Language and Speech Features for Cognitive-Linguistic Assessment
Figure 2 for A Review of Language and Speech Features for Cognitive-Linguistic Assessment
Figure 3 for A Review of Language and Speech Features for Cognitive-Linguistic Assessment
Figure 4 for A Review of Language and Speech Features for Cognitive-Linguistic Assessment
Viaarxiv icon

Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition

Oct 20, 2020
Yu Zhang, James Qin, Daniel S. Park, Wei Han, Chung-Cheng Chiu, Ruoming Pang, Quoc V. Le, Yonghui Wu

Figure 1 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Figure 2 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Figure 3 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Figure 4 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Viaarxiv icon

From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech

Add code
Bookmark button
Alert button
Apr 13, 2020
Hyeong-Seok Choi, Changdae Park, Kyogu Lee

Figure 1 for From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech
Figure 2 for From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech
Figure 3 for From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech
Figure 4 for From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech
Viaarxiv icon

Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator

May 18, 2022
Guangzhi Sun, Chao Zhang, Philip C Woodland

Figure 1 for Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
Figure 2 for Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
Figure 3 for Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
Figure 4 for Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
Viaarxiv icon

Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval

Add code
Bookmark button
Alert button
Jun 05, 2022
Xudong Lin, Simran Tiwari, Shiyuan Huang, Manling Li, Mike Zheng Shou, Heng Ji, Shih-Fu Chang

Figure 1 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 2 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 3 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 4 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Viaarxiv icon

Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech

Add code
Bookmark button
Alert button
Aug 03, 2020
Monica Sunkara, Srikanth Ronanki, Dhanush Bekal, Sravan Bodapati, Katrin Kirchhoff

Figure 1 for Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech
Figure 2 for Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech
Figure 3 for Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech
Figure 4 for Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech
Viaarxiv icon

A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech

Add code
Bookmark button
Alert button
Jun 07, 2021
Jingxuan Yang, Kerui Xu, Jun Xu, Si Li, Sheng Gao, Jun Guo, Nianwen Xue, Ji-Rong Wen

Figure 1 for A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech
Figure 2 for A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech
Figure 3 for A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech
Figure 4 for A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech
Viaarxiv icon

A survey on recently proposed activation functions for Deep Learning

Apr 07, 2022
Murilo Gustineli

Figure 1 for A survey on recently proposed activation functions for Deep Learning
Figure 2 for A survey on recently proposed activation functions for Deep Learning
Viaarxiv icon

A Novel Decision Tree for Depression Recognition in Speech

Feb 22, 2020
Zhenyu Liu, Dongyu Wang, Lan Zhang, Bin Hu

Figure 1 for A Novel Decision Tree for Depression Recognition in Speech
Figure 2 for A Novel Decision Tree for Depression Recognition in Speech
Figure 3 for A Novel Decision Tree for Depression Recognition in Speech
Figure 4 for A Novel Decision Tree for Depression Recognition in Speech
Viaarxiv icon