Alert button

"speech recognition": models, code, and papers
Alert button

Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time

Add code
Bookmark button
Alert button
Aug 03, 2023
Xinfeng Li, Chen Yan, Xuancun Lu, Zihan Zeng, Xiaoyu Ji, Wenyuan Xu

Figure 1 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time
Figure 2 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time
Figure 3 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time
Figure 4 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time
Viaarxiv icon

Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition

Add code
Bookmark button
Alert button
Feb 22, 2023
Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng

Figure 1 for Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
Figure 2 for Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
Figure 3 for Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
Figure 4 for Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
Viaarxiv icon

Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim

Add code
Bookmark button
Alert button
Aug 02, 2023
Xinfeng Li, Chen Yan, Xuancun Lu, Zihan Zeng, Xiaoyu Ji, Wenyuan Xu

Figure 1 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Figure 2 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Figure 3 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Figure 4 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Viaarxiv icon

Sparks of Large Audio Models: A Survey and Outlook

Add code
Bookmark button
Alert button
Aug 24, 2023
Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Heriberto Cuayáhuitl, Björn W. Schuller

Figure 1 for Sparks of Large Audio Models: A Survey and Outlook
Figure 2 for Sparks of Large Audio Models: A Survey and Outlook
Figure 3 for Sparks of Large Audio Models: A Survey and Outlook
Figure 4 for Sparks of Large Audio Models: A Survey and Outlook
Viaarxiv icon

Sim-T: Simplify the Transformer Network by Multiplexing Technique for Speech Recognition

Apr 11, 2023
Guangyong Wei, Zhikui Duan, Shiren Li, Guangguang Yang, Xinmei Yu, Junhua Li

Figure 1 for Sim-T: Simplify the Transformer Network by Multiplexing Technique for Speech Recognition
Figure 2 for Sim-T: Simplify the Transformer Network by Multiplexing Technique for Speech Recognition
Figure 3 for Sim-T: Simplify the Transformer Network by Multiplexing Technique for Speech Recognition
Figure 4 for Sim-T: Simplify the Transformer Network by Multiplexing Technique for Speech Recognition
Viaarxiv icon

Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model

Sep 05, 2023
Yuezhou Zhang, Amos A Folarin, Judith Dineley, Pauline Conde, Valeria de Angel, Shaoxiong Sun, Yatharth Ranjan, Zulqarnain Rashid, Callum Stewart, Petroula Laiou, Heet Sankesara, Linglong Qian, Faith Matcham, Katie M White, Carolin Oetzmann, Femke Lamers, Sara Siddi, Sara Simblett, Björn W. Schuller, Srinivasan Vairavan, Til Wykes, Josep Maria Haro, Brenda WJH Penninx, Vaibhav A Narayan, Matthew Hotopf, Richard JB Dobson, Nicholas Cummins, RADAR-CNS consortium

Figure 1 for Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
Figure 2 for Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
Figure 3 for Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
Figure 4 for Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
Viaarxiv icon

End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations

Aug 15, 2023
Bolaji Yusuf, Jan Cernocky, Murat Saraclar

Viaarxiv icon

Decoupled Structure for Improved Adaptability of End-to-End Models

Aug 25, 2023
Keqi Deng, Philip C. Woodland

Viaarxiv icon

Self-Supervised Masked Digital Elevation Models Encoding for Low-Resource Downstream Tasks

Sep 06, 2023
Priyam Mazumdar, Aiman Soliman, Volodymyr Kindratenko, Luigi Marini, Kenton McHenry

Viaarxiv icon

O-1: Self-training with Oracle and 1-best Hypothesis

Aug 14, 2023
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Kartik Audhkhasi

Figure 1 for O-1: Self-training with Oracle and 1-best Hypothesis
Figure 2 for O-1: Self-training with Oracle and 1-best Hypothesis
Figure 3 for O-1: Self-training with Oracle and 1-best Hypothesis
Figure 4 for O-1: Self-training with Oracle and 1-best Hypothesis
Viaarxiv icon