Alert button

"speech": models, code, and papers
Alert button

Rudolf Christoph Eucken at SemEval-2023 Task 4: An Ensemble Approach for Identifying Human Values from Arguments

May 09, 2023
Sougata Saha, Rohini Srihari

Figure 1 for Rudolf Christoph Eucken at SemEval-2023 Task 4: An Ensemble Approach for Identifying Human Values from Arguments
Figure 2 for Rudolf Christoph Eucken at SemEval-2023 Task 4: An Ensemble Approach for Identifying Human Values from Arguments
Figure 3 for Rudolf Christoph Eucken at SemEval-2023 Task 4: An Ensemble Approach for Identifying Human Values from Arguments
Figure 4 for Rudolf Christoph Eucken at SemEval-2023 Task 4: An Ensemble Approach for Identifying Human Values from Arguments
Viaarxiv icon

Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification

May 25, 2023
Gokul Bhusal, Ekaterina Merkurjev, Guo-Wei Wei

Figure 1 for Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification
Figure 2 for Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification
Figure 3 for Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification
Figure 4 for Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification
Viaarxiv icon

SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data

Sep 30, 2022
Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, Lirong Dai, Jinyu Li, Furu Wei

Figure 1 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 2 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 3 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 4 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Viaarxiv icon

Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems

Feb 15, 2023
Jiajun Deng, Xurong Xie, Tianzi Wang, Mingyu Cui, Boyang Xue, Zengrui Jin, Guinan Li, Shujie Hu, Xunying Liu

Figure 1 for Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems
Figure 2 for Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems
Figure 3 for Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems
Figure 4 for Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems
Viaarxiv icon

DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement

Dec 15, 2022
Dongheon Lee, Jung-Woo Choi

Figure 1 for DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Figure 2 for DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Figure 3 for DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Figure 4 for DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement
Viaarxiv icon

Persian Typographical Error Type Detection using Many-to-Many Deep Neural Networks on Algorithmically-Generated Misspellings

May 19, 2023
Mohammad Dehghani, Heshaam Faili

Figure 1 for Persian Typographical Error Type Detection using Many-to-Many Deep Neural Networks on Algorithmically-Generated Misspellings
Figure 2 for Persian Typographical Error Type Detection using Many-to-Many Deep Neural Networks on Algorithmically-Generated Misspellings
Figure 3 for Persian Typographical Error Type Detection using Many-to-Many Deep Neural Networks on Algorithmically-Generated Misspellings
Figure 4 for Persian Typographical Error Type Detection using Many-to-Many Deep Neural Networks on Algorithmically-Generated Misspellings
Viaarxiv icon

Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data

Dec 20, 2022
Mozhdeh Gheini, Tatiana Likhomanenko, Matthias Sperber, Hendra Setiawan

Figure 1 for Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data
Figure 2 for Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data
Figure 3 for Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data
Figure 4 for Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data
Viaarxiv icon

Training Transitive and Commutative Multimodal Transformers with LoReTTa

May 23, 2023
Manuel Tran, Amal Lahiani, Yashin Dicente Cid, Fabian J. Theis, Tingying Peng, Eldad Klaiman

Figure 1 for Training Transitive and Commutative Multimodal Transformers with LoReTTa
Figure 2 for Training Transitive and Commutative Multimodal Transformers with LoReTTa
Figure 3 for Training Transitive and Commutative Multimodal Transformers with LoReTTa
Figure 4 for Training Transitive and Commutative Multimodal Transformers with LoReTTa
Viaarxiv icon

Distance-based Weight Transfer from Near-field to Far-field Speaker Verification

Mar 15, 2023
Li Zhang, Qing Wang, Hongji Wang, Yue Li, Wei Rao, Yannan Wang, Lei Xie

Figure 1 for Distance-based Weight Transfer from Near-field to Far-field Speaker Verification
Figure 2 for Distance-based Weight Transfer from Near-field to Far-field Speaker Verification
Figure 3 for Distance-based Weight Transfer from Near-field to Far-field Speaker Verification
Viaarxiv icon

Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding

Nov 04, 2022
Haici Yang, Wootaek Lim, Minje Kim

Figure 1 for Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding
Figure 2 for Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding
Figure 3 for Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding
Figure 4 for Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding
Viaarxiv icon