Alert button

"speech": models, code, and papers
Alert button

I run as fast as a rabbit, can you? A Multilingual Simile Dialogue Dataset

Add code
Bookmark button
Alert button
Jun 09, 2023
Longxuan Ma, Weinan Zhang, Shuhan Zhou, Churui Sun, Changxin Ke, Ting Liu

Figure 1 for I run as fast as a rabbit, can you? A Multilingual Simile Dialogue Dataset
Figure 2 for I run as fast as a rabbit, can you? A Multilingual Simile Dialogue Dataset
Figure 3 for I run as fast as a rabbit, can you? A Multilingual Simile Dialogue Dataset
Figure 4 for I run as fast as a rabbit, can you? A Multilingual Simile Dialogue Dataset
Viaarxiv icon

Feature Selection using Sparse Adaptive Bottleneck Centroid-Encoder

Add code
Bookmark button
Alert button
Jun 09, 2023
Tomojit Ghosh, Michael Kirby

Figure 1 for Feature Selection using Sparse Adaptive Bottleneck Centroid-Encoder
Figure 2 for Feature Selection using Sparse Adaptive Bottleneck Centroid-Encoder
Figure 3 for Feature Selection using Sparse Adaptive Bottleneck Centroid-Encoder
Figure 4 for Feature Selection using Sparse Adaptive Bottleneck Centroid-Encoder
Viaarxiv icon

Transformer-based Sequence Labeling for Audio Classification based on MFCCs

Apr 30, 2023
C. S. Sonali, Chinmayi B S, Ahana Balasubramanian

Figure 1 for Transformer-based Sequence Labeling for Audio Classification based on MFCCs
Figure 2 for Transformer-based Sequence Labeling for Audio Classification based on MFCCs
Figure 3 for Transformer-based Sequence Labeling for Audio Classification based on MFCCs
Figure 4 for Transformer-based Sequence Labeling for Audio Classification based on MFCCs
Viaarxiv icon

High Fidelity Speech Enhancement with Band-split RNN

Add code
Bookmark button
Alert button
Dec 01, 2022
Jianwei Yu, Yi Luo, Hangting Chen, Rongzhi Gu, Chao Weng

Figure 1 for High Fidelity Speech Enhancement with Band-split RNN
Figure 2 for High Fidelity Speech Enhancement with Band-split RNN
Viaarxiv icon

Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining

Add code
Bookmark button
Alert button
Jan 30, 2023
Takaaki Saeki, Soumi Maiti, Xinjian Li, Shinji Watanabe, Shinnosuke Takamichi, Hiroshi Saruwatari

Figure 1 for Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
Figure 2 for Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
Figure 3 for Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
Figure 4 for Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
Viaarxiv icon

Antisemitic Messages? A Guide to High-Quality Annotation and a Labeled Dataset of Tweets

Add code
Bookmark button
Alert button
Apr 28, 2023
Gunther Jikeli, Sameer Karali, Daniel Miehling, Katharina Soemer

Figure 1 for Antisemitic Messages? A Guide to High-Quality Annotation and a Labeled Dataset of Tweets
Figure 2 for Antisemitic Messages? A Guide to High-Quality Annotation and a Labeled Dataset of Tweets
Viaarxiv icon

LongFNT: Long-form Speech Recognition with Factorized Neural Transducer

Nov 17, 2022
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian

Figure 1 for LongFNT: Long-form Speech Recognition with Factorized Neural Transducer
Figure 2 for LongFNT: Long-form Speech Recognition with Factorized Neural Transducer
Figure 3 for LongFNT: Long-form Speech Recognition with Factorized Neural Transducer
Figure 4 for LongFNT: Long-form Speech Recognition with Factorized Neural Transducer
Viaarxiv icon

Joint unsupervised and supervised learning for context-aware language identification

Apr 14, 2023
Jinseok Park, Hyung Yong Kim, Jihwan Park, Byeong-Yeol Kim, Shukjae Choi, Yunkyu Lim

Figure 1 for Joint unsupervised and supervised learning for context-aware language identification
Figure 2 for Joint unsupervised and supervised learning for context-aware language identification
Figure 3 for Joint unsupervised and supervised learning for context-aware language identification
Figure 4 for Joint unsupervised and supervised learning for context-aware language identification
Viaarxiv icon

XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers

Oct 29, 2022
Roshan Sharma, Bhiksha Raj

Figure 1 for XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers
Figure 2 for XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers
Figure 3 for XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers
Figure 4 for XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers
Viaarxiv icon

Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects

Add code
Bookmark button
Alert button
Jun 14, 2023
Xinghua Qu, Hongyang Liu, Zhu Sun, Xiang Yin, Yew Soon Ong, Lu Lu, Zejun Ma

Figure 1 for Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects
Figure 2 for Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects
Figure 3 for Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects
Figure 4 for Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects
Viaarxiv icon