Alert button

"speech recognition": models, code, and papers
Alert button

Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems

Sep 09, 2019
Lea Schönherr, Steffen Zeiler, Thorsten Holz, Dorothea Kolossa

Figure 1 for Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems
Figure 2 for Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems
Figure 3 for Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems
Figure 4 for Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems
Viaarxiv icon

End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System

May 17, 2019
Emiru Tsunoo, Yosuke Kashiwagi, Satoshi Asakawa, Toshiyuki Kumakura

Figure 1 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Figure 2 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Figure 3 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Figure 4 for End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
Viaarxiv icon

T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events

Feb 07, 2022
Shu Wang, Yuhuang Hu, Shih-Chii Liu

Figure 1 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Figure 2 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Figure 3 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Figure 4 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Viaarxiv icon

Residual Convolutional CTC Networks for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Feb 24, 2017
Yisen Wang, Xuejiao Deng, Songbai Pu, Zhiheng Huang

Figure 1 for Residual Convolutional CTC Networks for Automatic Speech Recognition
Figure 2 for Residual Convolutional CTC Networks for Automatic Speech Recognition
Figure 3 for Residual Convolutional CTC Networks for Automatic Speech Recognition
Figure 4 for Residual Convolutional CTC Networks for Automatic Speech Recognition
Viaarxiv icon

ViDA-MAN: Visual Dialog with Digital Humans

Oct 26, 2021
Tong Shen, Jiawei Zuo, Fan Shi, Jin Zhang, Liqin Jiang, Meng Chen, Zhengchen Zhang, Wei Zhang, Xiaodong He, Tao Mei

Figure 1 for ViDA-MAN: Visual Dialog with Digital Humans
Figure 2 for ViDA-MAN: Visual Dialog with Digital Humans
Viaarxiv icon

Deep transfer learning for partial differential equations under conditional shift with DeepONet

Add code
Bookmark button
Alert button
Apr 20, 2022
Somdatta Goswami, Katiana Kontolati, Michael D. Shields, George Em Karniadakis

Figure 1 for Deep transfer learning for partial differential equations under conditional shift with DeepONet
Figure 2 for Deep transfer learning for partial differential equations under conditional shift with DeepONet
Figure 3 for Deep transfer learning for partial differential equations under conditional shift with DeepONet
Figure 4 for Deep transfer learning for partial differential equations under conditional shift with DeepONet
Viaarxiv icon

Streaming Multi-Talker ASR with Token-Level Serialized Output Training

Feb 05, 2022
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka

Figure 1 for Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Figure 2 for Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Figure 3 for Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Figure 4 for Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Viaarxiv icon

Accent Recognition with Hybrid Phonetic Features

Add code
Bookmark button
Alert button
May 05, 2021
Zhan Zhang, Xi Chen, Yuehai Wang, Jianyi Yang

Figure 1 for Accent Recognition with Hybrid Phonetic Features
Figure 2 for Accent Recognition with Hybrid Phonetic Features
Figure 3 for Accent Recognition with Hybrid Phonetic Features
Figure 4 for Accent Recognition with Hybrid Phonetic Features
Viaarxiv icon

Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems

Dec 10, 2021
Manaal Faruqui, Dilek Hakkani-Tür

Figure 1 for Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems
Figure 2 for Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems
Figure 3 for Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems
Figure 4 for Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems
Viaarxiv icon

Late reverberation suppression using U-nets

Oct 05, 2021
Diego León, Felipe Tobar

Figure 1 for Late reverberation suppression using U-nets
Figure 2 for Late reverberation suppression using U-nets
Figure 3 for Late reverberation suppression using U-nets
Figure 4 for Late reverberation suppression using U-nets
Viaarxiv icon