Alert button

"speech": models, code, and papers
Alert button

Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network

Mar 13, 2023
Cong Han, Nima Mesgarani

Figure 1 for Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network
Figure 2 for Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network
Figure 3 for Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network
Figure 4 for Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network
Viaarxiv icon

ITALIC: An Italian Intent Classification Dataset

Add code
Bookmark button
Alert button
Jun 14, 2023
Alkis Koudounas, Moreno La Quatra, Lorenzo Vaiani, Luca Colomba, Giuseppe Attanasio, Eliana Pastor, Luca Cagliero, Elena Baralis

Figure 1 for ITALIC: An Italian Intent Classification Dataset
Figure 2 for ITALIC: An Italian Intent Classification Dataset
Figure 3 for ITALIC: An Italian Intent Classification Dataset
Figure 4 for ITALIC: An Italian Intent Classification Dataset
Viaarxiv icon

Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition

Mar 23, 2023
Kai Liu, Hailiang Xiong, Gangqiang Yang, Zhengfeng Du, Yewen Cao, Danyal Shah

Figure 1 for Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition
Figure 2 for Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition
Figure 3 for Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition
Figure 4 for Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition
Viaarxiv icon

A Federated Approach for Hate Speech Detection

Add code
Bookmark button
Alert button
Feb 18, 2023
Jay Gala, Deep Gandhi, Jash Mehta, Zeerak Talat

Figure 1 for A Federated Approach for Hate Speech Detection
Figure 2 for A Federated Approach for Hate Speech Detection
Figure 3 for A Federated Approach for Hate Speech Detection
Figure 4 for A Federated Approach for Hate Speech Detection
Viaarxiv icon

AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation

Add code
Bookmark button
Alert button
May 19, 2023
Sara Papi, Marco Turchi, Matteo Negri

Figure 1 for AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation
Figure 2 for AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation
Figure 3 for AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation
Figure 4 for AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation
Viaarxiv icon

Ensemble knowledge distillation of self-supervised speech models

Add code
Bookmark button
Alert button
Feb 24, 2023
Kuan-Po Huang, Tzu-hsun Feng, Yu-Kuan Fu, Tsu-Yuan Hsu, Po-Chieh Yen, Wei-Cheng Tseng, Kai-Wei Chang, Hung-yi Lee

Figure 1 for Ensemble knowledge distillation of self-supervised speech models
Figure 2 for Ensemble knowledge distillation of self-supervised speech models
Viaarxiv icon

Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers

May 23, 2023
Jan Silovsky, Liuhui Deng, Arturo Argueta, Tresi Arvizo, Roger Hsiao, Sasha Kuznietsov, Yiu-Chang Lin, Xiaoqiang Xiao, Yuanyuan Zhang

Figure 1 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Figure 2 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Figure 3 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Figure 4 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Viaarxiv icon

A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech

Add code
Bookmark button
Alert button
Feb 08, 2023
Li-Wei Chen, Shinji Watanabe, Alexander Rudnicky

Figure 1 for A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech
Figure 2 for A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech
Figure 3 for A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech
Figure 4 for A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech
Viaarxiv icon

Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)

Add code
Bookmark button
Alert button
Jul 17, 2023
Subba Reddy Oota, Manish Gupta, Raju S. Bapi, Gael Jobard, Frederic Alexandre, Xavier Hinaut

Figure 1 for Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)
Figure 2 for Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)
Figure 3 for Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)
Figure 4 for Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)
Viaarxiv icon

DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization

Aug 04, 2023
Wang Lu, Jindong Wang, Xinwei Sun, Yiqiang Chen, Xiangyang Ji, Qiang Yang, Xing Xie

Figure 1 for DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization
Figure 2 for DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization
Figure 3 for DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization
Figure 4 for DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization
Viaarxiv icon