Alert button

"speech": models, code, and papers
Alert button

Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes

Jun 23, 2022
Danilo de Oliveira, Tal Peer, Timo Gerkmann

Figure 1 for Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Figure 2 for Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Figure 3 for Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Figure 4 for Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Viaarxiv icon

BrainBERT: Self-supervised representation learning for intracranial recordings

Add code
Bookmark button
Alert button
Feb 28, 2023
Christopher Wang, Vighnesh Subramaniam, Adam Uri Yaari, Gabriel Kreiman, Boris Katz, Ignacio Cases, Andrei Barbu

Figure 1 for BrainBERT: Self-supervised representation learning for intracranial recordings
Figure 2 for BrainBERT: Self-supervised representation learning for intracranial recordings
Figure 3 for BrainBERT: Self-supervised representation learning for intracranial recordings
Figure 4 for BrainBERT: Self-supervised representation learning for intracranial recordings
Viaarxiv icon

Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models

Add code
Bookmark button
Alert button
Feb 15, 2023
Abteen Ebrahimi, Arya D. McCarthy, Arturo Oncevay, Luis Chiruzzo, John E. Ortega, Gustavo A. Giménez-Lugo, Rolando Coto-Solano, Katharina Kann

Figure 1 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 2 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 3 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 4 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Viaarxiv icon

Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech

May 10, 2022
Ilya Sklyar, Anna Piunova, Christian Osendorfer

Figure 1 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 2 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 3 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 4 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Viaarxiv icon

Supervised Acoustic Embeddings And Their Transferability Across Languages

Add code
Bookmark button
Alert button
Jan 03, 2023
Sreepratha Ram, Hanan Aldarmaki

Figure 1 for Supervised Acoustic Embeddings And Their Transferability Across Languages
Figure 2 for Supervised Acoustic Embeddings And Their Transferability Across Languages
Figure 3 for Supervised Acoustic Embeddings And Their Transferability Across Languages
Figure 4 for Supervised Acoustic Embeddings And Their Transferability Across Languages
Viaarxiv icon

Efficiency 360: Efficient Vision Transformers

Add code
Bookmark button
Alert button
Feb 17, 2023
Badri N. Patro, Vijay Srinivas Agneeswaran

Figure 1 for Efficiency 360: Efficient Vision Transformers
Figure 2 for Efficiency 360: Efficient Vision Transformers
Figure 3 for Efficiency 360: Efficient Vision Transformers
Figure 4 for Efficiency 360: Efficient Vision Transformers
Viaarxiv icon

Minimum Processing Near-end Listening Enhancement

Oct 31, 2022
Andreas Jonas Fuglsig, Jesper Jensen, Zheng-Hua Tan, Lars Søndergaard Bertelsen, Jens Christian Lindof, Jan Østergaard

Figure 1 for Minimum Processing Near-end Listening Enhancement
Figure 2 for Minimum Processing Near-end Listening Enhancement
Figure 3 for Minimum Processing Near-end Listening Enhancement
Figure 4 for Minimum Processing Near-end Listening Enhancement
Viaarxiv icon

Data Augmentation with Locally-time Reversed Speech for Automatic Speech Recognition

Oct 09, 2021
Si-Ioi Ng, Tan Lee

Figure 1 for Data Augmentation with Locally-time Reversed Speech for Automatic Speech Recognition
Figure 2 for Data Augmentation with Locally-time Reversed Speech for Automatic Speech Recognition
Figure 3 for Data Augmentation with Locally-time Reversed Speech for Automatic Speech Recognition
Figure 4 for Data Augmentation with Locally-time Reversed Speech for Automatic Speech Recognition
Viaarxiv icon

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

Add code
Bookmark button
Alert button
Mar 24, 2022
Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, Bolei Zhou

Figure 1 for Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
Figure 2 for Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
Figure 3 for Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
Figure 4 for Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
Viaarxiv icon

Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax

Feb 16, 2023
Keqi Deng, Philip C. Woodland

Figure 1 for Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax
Figure 2 for Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax
Figure 3 for Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax
Figure 4 for Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax
Viaarxiv icon