Alert button

"speech": models, code, and papers
Alert button

AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description

Oct 10, 2023
Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman

Figure 1 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 2 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 3 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Figure 4 for AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Viaarxiv icon

Sparse Finetuning for Inference Acceleration of Large Language Models

Add code
Bookmark button
Alert button
Oct 10, 2023
Eldar Kurtic, Denis Kuznedelev, Elias Frantar, Michael Goin, Dan Alistarh

Viaarxiv icon

Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim

Add code
Bookmark button
Alert button
Aug 02, 2023
Xinfeng Li, Chen Yan, Xuancun Lu, Zihan Zeng, Xiaoyu Ji, Wenyuan Xu

Figure 1 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Figure 2 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Figure 3 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Figure 4 for Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Tim
Viaarxiv icon

Cross-modal Alignment with Optimal Transport for CTC-based ASR

Sep 24, 2023
Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Viaarxiv icon

Wiki-En-ASR-Adapt: Large-scale synthetic dataset for English ASR Customization

Sep 29, 2023
Alexandra Antonova

Viaarxiv icon

Multi-Channel MOSRA: Mean Opinion Score and Room Acoustics Estimation Using Simulated Data and a Teacher Model

Sep 21, 2023
Jozef Coldenhoff, Andrew Harper, Paul Kendrick, Tijana Stojkovic, Milos Cernak

Viaarxiv icon

Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders

May 25, 2023
Nina R Benway, Yashish M Siriwardena, Jonathan L Preston, Elaine Hitchcock, Tara McAllister, Carol Espy-Wilson

Figure 1 for Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders
Figure 2 for Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders
Figure 3 for Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders
Figure 4 for Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders
Viaarxiv icon

Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction

Jun 14, 2023
Wenzhe Liu, Yupeng Shi, Jun Chen, Wei Rao, Shulin He, Andong Li, Yannan Wang, Zhiyong Wu

Figure 1 for Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction
Figure 2 for Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction
Figure 3 for Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction
Figure 4 for Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction
Viaarxiv icon

Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters

Jul 02, 2023
Anshu Bhatia, Sanchit Sinha, Saket Dingliwal, Karthik Gopalakrishnan, Sravan Bodapati, Katrin Kirchhoff

Figure 1 for Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
Figure 2 for Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
Figure 3 for Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
Figure 4 for Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
Viaarxiv icon

A comparative study of Grid and Natural sentences effects on Normal-to-Lombard conversion

Sep 19, 2023
Hongyang Chen, Yuhong Yang, Qingmu Liu, Baifeng Li, Weiping Tu, Song Lin

Figure 1 for A comparative study of Grid and Natural sentences effects on Normal-to-Lombard conversion
Figure 2 for A comparative study of Grid and Natural sentences effects on Normal-to-Lombard conversion
Figure 3 for A comparative study of Grid and Natural sentences effects on Normal-to-Lombard conversion
Viaarxiv icon