Alert button

"speech recognition": models, code, and papers
Alert button

Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding

May 23, 2023
Zheng Chen, Ziyan Jiang, Fan Yang

Figure 1 for Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
Figure 2 for Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
Figure 3 for Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
Figure 4 for Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding
Viaarxiv icon

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Apr 13, 2022
Shaojin Ding, Rajeev Rikhye, Qiao Liang, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw

Figure 1 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 2 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 3 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Figure 4 for Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Viaarxiv icon

Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR

Apr 25, 2023
Xilai Li, Goeric Huybrechts, Srikanth Ronanki, Jeff Farris, Sravan Bodapati

Figure 1 for Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR
Figure 2 for Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR
Figure 3 for Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR
Figure 4 for Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR
Viaarxiv icon

Contextual Adapters for Personalized Speech Recognition in Neural Transducers

May 26, 2022
Kanthashree Mysore Sathyendra, Thejaswi Muniyappa, Feng-Ju Chang, Jing Liu, Jinru Su, Grant P. Strimel, Athanasios Mouchtaris, Siegfried Kunzmann

Figure 1 for Contextual Adapters for Personalized Speech Recognition in Neural Transducers
Figure 2 for Contextual Adapters for Personalized Speech Recognition in Neural Transducers
Figure 3 for Contextual Adapters for Personalized Speech Recognition in Neural Transducers
Figure 4 for Contextual Adapters for Personalized Speech Recognition in Neural Transducers
Viaarxiv icon

Confidence Score Based Conformer Speaker Adaptation for Speech Recognition

Jun 24, 2022
Jiajun Deng, Xurong Xie, Tianzi Wang, Mingyu Cui, Boyang Xue, Zengrui Jin, Mengzhe Geng, Guinan Li, Xunying Liu, Helen Meng

Figure 1 for Confidence Score Based Conformer Speaker Adaptation for Speech Recognition
Figure 2 for Confidence Score Based Conformer Speaker Adaptation for Speech Recognition
Figure 3 for Confidence Score Based Conformer Speaker Adaptation for Speech Recognition
Figure 4 for Confidence Score Based Conformer Speaker Adaptation for Speech Recognition
Viaarxiv icon

Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask

Oct 08, 2021
Shaoshi Ling, Chen Shen, Meng Cai, Zejun Ma

Figure 1 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Figure 2 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Figure 3 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Figure 4 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Viaarxiv icon

Efficient Training of Neural Transducer for Speech Recognition

Apr 22, 2022
Wei Zhou, Wilfried Michel, Ralf Schlüter, Hermann Ney

Figure 1 for Efficient Training of Neural Transducer for Speech Recognition
Figure 2 for Efficient Training of Neural Transducer for Speech Recognition
Figure 3 for Efficient Training of Neural Transducer for Speech Recognition
Figure 4 for Efficient Training of Neural Transducer for Speech Recognition
Viaarxiv icon

Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition

Add code
Bookmark button
Alert button
Apr 08, 2022
Qianying Liu, Yuhang Yang, Zhuo Gong, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Sadao Kurohashi

Figure 1 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 2 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 3 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 4 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Viaarxiv icon

Joint unsupervised and supervised learning for context-aware language identification

Apr 14, 2023
Jinseok Park, Hyung Yong Kim, Jihwan Park, Byeong-Yeol Kim, Shukjae Choi, Yunkyu Lim

Figure 1 for Joint unsupervised and supervised learning for context-aware language identification
Figure 2 for Joint unsupervised and supervised learning for context-aware language identification
Figure 3 for Joint unsupervised and supervised learning for context-aware language identification
Figure 4 for Joint unsupervised and supervised learning for context-aware language identification
Viaarxiv icon

deep learning of segment-level feature representation for speech emotion recognition in conversations

Feb 05, 2023
Jiachen Luo, Huy Phan, Joshua Reiss

Figure 1 for deep learning of segment-level feature representation for speech emotion recognition in conversations
Figure 2 for deep learning of segment-level feature representation for speech emotion recognition in conversations
Figure 3 for deep learning of segment-level feature representation for speech emotion recognition in conversations
Figure 4 for deep learning of segment-level feature representation for speech emotion recognition in conversations
Viaarxiv icon