Alert button

"speech": models, code, and papers
Alert button

Parmesan: mathematical concept extraction for education

Add code
Bookmark button
Alert button
Jul 17, 2023
Jacob Collard, Valeria de Paiva, Eswaran Subrahmanian

Figure 1 for Parmesan: mathematical concept extraction for education
Figure 2 for Parmesan: mathematical concept extraction for education
Figure 3 for Parmesan: mathematical concept extraction for education
Figure 4 for Parmesan: mathematical concept extraction for education
Viaarxiv icon

Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices

Jul 14, 2023
Zitha Sasindran, Harsha Yelchuri, T. V. Prabhakar

Figure 1 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Figure 2 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Figure 3 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Figure 4 for Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices
Viaarxiv icon

Improving RNN-Transducers with Acoustic LookAhead

Jul 11, 2023
Vinit S. Unni, Ashish Mittal, Preethi Jyothi, Sunita Sarawagi

Figure 1 for Improving RNN-Transducers with Acoustic LookAhead
Figure 2 for Improving RNN-Transducers with Acoustic LookAhead
Figure 3 for Improving RNN-Transducers with Acoustic LookAhead
Figure 4 for Improving RNN-Transducers with Acoustic LookAhead
Viaarxiv icon

Conformers are All You Need for Visual Speech Recogntion

Feb 17, 2023
Oscar Chang, Hank Liao, Dmitriy Serdyuk, Ankit Shah, Olivier Siohan

Figure 1 for Conformers are All You Need for Visual Speech Recogntion
Figure 2 for Conformers are All You Need for Visual Speech Recogntion
Figure 3 for Conformers are All You Need for Visual Speech Recogntion
Figure 4 for Conformers are All You Need for Visual Speech Recogntion
Viaarxiv icon

Speech-based Age and Gender Prediction with Transformers

Add code
Bookmark button
Alert button
Jun 29, 2023
Felix Burkhardt, Johannes Wagner, Hagen Wierstorf, Florian Eyben, Björn Schuller

Figure 1 for Speech-based Age and Gender Prediction with Transformers
Figure 2 for Speech-based Age and Gender Prediction with Transformers
Figure 3 for Speech-based Age and Gender Prediction with Transformers
Figure 4 for Speech-based Age and Gender Prediction with Transformers
Viaarxiv icon

TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection

Jun 27, 2023
Jie Liu, Zhiba Su, Hui Huang, Caiyan Wan, Quanxiu Wang, Jiangli Hong, Benlai Tang, Fengjie Zhu

Figure 1 for TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection
Figure 2 for TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection
Figure 3 for TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection
Viaarxiv icon

Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization

Apr 27, 2023
Hamza Kheddar, Yassine Himeur, Somaya Al-Maadeed, Abbes Amira, Faycal Bensaali

Figure 1 for Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization
Figure 2 for Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization
Figure 3 for Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization
Figure 4 for Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization
Viaarxiv icon

Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference

Add code
Bookmark button
Alert button
Mar 14, 2023
Biao Fu, Kai Fan, Minpeng Liao, Zhongqiang Huang, Boxing Chen, Yidong Chen, Xiaodong Shi

Figure 1 for Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference
Figure 2 for Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference
Figure 3 for Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference
Figure 4 for Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference
Viaarxiv icon

MERLIon CCS Challenge Evaluation Plan

Add code
Bookmark button
Alert button
May 31, 2023
Leibny Paola Garcia Perera, Y. H. Victoria Chua, Hexin Liu, Fei Ting Woon, Andy W. H. Khong, Justin Dauwels, Sanjeev Khudanpur, Suzy J. Styles

Figure 1 for MERLIon CCS Challenge Evaluation Plan
Figure 2 for MERLIon CCS Challenge Evaluation Plan
Figure 3 for MERLIon CCS Challenge Evaluation Plan
Figure 4 for MERLIon CCS Challenge Evaluation Plan
Viaarxiv icon

Multilingual Multi-Figurative Language Detection

Add code
Bookmark button
Alert button
May 31, 2023
Huiyuan Lai, Antonio Toral, Malvina Nissim

Figure 1 for Multilingual Multi-Figurative Language Detection
Figure 2 for Multilingual Multi-Figurative Language Detection
Figure 3 for Multilingual Multi-Figurative Language Detection
Figure 4 for Multilingual Multi-Figurative Language Detection
Viaarxiv icon