Alert button

"speech recognition": models, code, and papers
Alert button

"Notic My Speech" -- Blending Speech Patterns With Multimedia

Jun 12, 2020
Dhruva Sahrawat, Yaman Kumar, Shashwat Aggarwal, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann

Figure 1 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Figure 2 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Figure 3 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Figure 4 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Viaarxiv icon

TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos

Nov 19, 2020
Manuel Sam Ribeiro, Jennifer Sanger, Jing-Xuan Zhang, Aciel Eshky, Alan Wrench, Korin Richmond, Steve Renals

Figure 1 for TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos
Figure 2 for TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos
Figure 3 for TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos
Figure 4 for TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos
Viaarxiv icon

Keyword Transformer: A Self-Attention Model for Keyword Spotting

Add code
Bookmark button
Alert button
Apr 15, 2021
Axel Berg, Mark O'Connor, Miguel Tairum Cruz

Figure 1 for Keyword Transformer: A Self-Attention Model for Keyword Spotting
Figure 2 for Keyword Transformer: A Self-Attention Model for Keyword Spotting
Figure 3 for Keyword Transformer: A Self-Attention Model for Keyword Spotting
Figure 4 for Keyword Transformer: A Self-Attention Model for Keyword Spotting
Viaarxiv icon

Hierarchical Summarization for Longform Spoken Dialog

Aug 21, 2021
Daniel Li, Thomas Chen, Albert Tung, Lydia Chilton

Figure 1 for Hierarchical Summarization for Longform Spoken Dialog
Figure 2 for Hierarchical Summarization for Longform Spoken Dialog
Figure 3 for Hierarchical Summarization for Longform Spoken Dialog
Figure 4 for Hierarchical Summarization for Longform Spoken Dialog
Viaarxiv icon

Mixtures of Deep Neural Experts for Automated Speech Scoring

Jun 23, 2021
Sara Papi, Edmondo Trentin, Roberto Gretter, Marco Matassoni, Daniele Falavigna

Figure 1 for Mixtures of Deep Neural Experts for Automated Speech Scoring
Figure 2 for Mixtures of Deep Neural Experts for Automated Speech Scoring
Figure 3 for Mixtures of Deep Neural Experts for Automated Speech Scoring
Viaarxiv icon

May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance

Add code
Bookmark button
Alert button
Oct 29, 2020
Micaela Kaplan

Figure 1 for May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance
Figure 2 for May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance
Figure 3 for May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance
Figure 4 for May I Ask Who's Calling? Named Entity Recognition on Call Center Transcripts for Privacy Law Compliance
Viaarxiv icon

Domain Adaptation Using Class Similarity for Robust Speech Recognition

Add code
Bookmark button
Alert button
Nov 05, 2020
Han Zhu, Jiangjiang Zhao, Yuling Ren, Li Wang, Pengyuan Zhang

Figure 1 for Domain Adaptation Using Class Similarity for Robust Speech Recognition
Figure 2 for Domain Adaptation Using Class Similarity for Robust Speech Recognition
Viaarxiv icon

Convolutional Neural Networks for Speech Controlled Prosthetic Hands

Oct 03, 2019
Mohsen Jafarzadeh, Yonas Tadesse

Figure 1 for Convolutional Neural Networks for Speech Controlled Prosthetic Hands
Figure 2 for Convolutional Neural Networks for Speech Controlled Prosthetic Hands
Figure 3 for Convolutional Neural Networks for Speech Controlled Prosthetic Hands
Figure 4 for Convolutional Neural Networks for Speech Controlled Prosthetic Hands
Viaarxiv icon

Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data

Add code
Bookmark button
Alert button
Jan 31, 2022
Amir Shirian, Krishna Somandepalli, Tanaya Guha

Figure 1 for Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data
Figure 2 for Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data
Figure 3 for Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data
Figure 4 for Self-supervised Graphs for Audio Representation Learning with Limited Labeled Data
Viaarxiv icon

MeshRIR: A Dataset of Room Impulse Responses on Meshed Grid Points For Evaluating Sound Field Analysis and Synthesis Methods

Add code
Bookmark button
Alert button
Jun 21, 2021
Shoichi Koyama, Tomoya Nishida, Keisuke Kimura, Takumi Abe, Natsuki Ueno, Jesper Brunnström

Figure 1 for MeshRIR: A Dataset of Room Impulse Responses on Meshed Grid Points For Evaluating Sound Field Analysis and Synthesis Methods
Figure 2 for MeshRIR: A Dataset of Room Impulse Responses on Meshed Grid Points For Evaluating Sound Field Analysis and Synthesis Methods
Figure 3 for MeshRIR: A Dataset of Room Impulse Responses on Meshed Grid Points For Evaluating Sound Field Analysis and Synthesis Methods
Figure 4 for MeshRIR: A Dataset of Room Impulse Responses on Meshed Grid Points For Evaluating Sound Field Analysis and Synthesis Methods
Viaarxiv icon